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QUANTITATIVE ASSAY FOR DETECTION OF NEWLY SYNTHESIZED RNA IN A 
CELL-FREE SYSTEM AND IDENTIFICATION OF RNA SYNTHESIS INHIBITORS 

BACKGROUND 

[0001] HCV is one of the most prevalent causes of chronic liver disease in the United 
States, which accounts for about 15 percent of acute viral hepatitis, 60 to 70 percent of chronic 
hepatitis, and up to 50 percent of cirrhosis, end-stage liver disease, and liver cancer. Almost 4 
million Americans, or 1.8 percent of the U.S. population, have antibodies to HCV (i.e., anti-HCV 
antibodies), indicating ongoing or previous infection with the virus. Hepatitis C causes an 
estimated 8,000 to 10,000 deaths annually in the United States. While the acute phase of HCV 
infection is usually associated with mild symptoms, some evidence suggests that only about 15% 
to 20% of the infected people will clear HCV. 

[0002] HCV is a small, enveloped, single-stranded positive RNA virus in the Flaviviridae 
family. The genome is approximately 10,000 nucleotides and encodes a single polyprotein of 
about 3,000 amino acids. All of the protein products of HCV are produced by proteolytic 
cleavage of the polyprotein, carried out by one of three proteases: the host signal peptidase, the 
viral self-cleaving metalloproteinase, NS2, and the viral serine protease NS3/4A. The combined 
action of these enzymes produces the structural proteins (C, El and E2) and non-structural 
proteins (NS2, NS3, NS4A, NS4B, NS5A, and NS5B) that are required for replication and 
packaging of the viral genomic RNA. NS5B is the viral RNA-dependent RNA polymerase 
(RDRP) that is responsible for the conversion of the input genomic RNA into a minus strand 
copy (complimentary RNA, or cRNA); the cRNA then serves as a template for transcription by 
NS5B of the positive sense genomic/messenger RNA. The HCV replicase is the complex of 
proteins that are necessary for the accurate and efficient synthesis of viral replicon RNA. 

[0003] Currently, the only effective therapy against HCV is alpha-interferon, which 
reduces the amount of virus in the liver and blood (e.g., viral load) in only a small proportion of 
infected patients. Standard forms of interferon, however, are now being replaced by pegylated 
interferons (peginterferons), alpha interferons that have been modified chemically by the 
addition of a large inert molecule of polyethylene glycol. At the present time, the optimal 
regimen including interferon appears to be a 24- or 48-week course of a combination of 
pegylated alpha interferon and the nucleoside ribavirin, an oral antiviral agent that has activity 
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against a broad range of viruses. Nonetheless, response rates to the combination 
interferon/ribavirin therapy may be moderate for certain HCV genotypes, i.e., a response rate of 
about 50% to about 60%, although response rates for selected genotypes of HCV (notably 
genotypes 2 and 3) are typically higher. Another drawback to the current therapy is that there are 
often significant adverse side effects associated with each of these agents including, for example, 
flu-like symptoms; bone marrow suppressive effects; neuropsychiatry effects such as marked 
irritability, anxiety, personality changes, depression, and even suicide or acute psychosis; 
histamine-like side effects; and anemia. 

[0004] Taken together, the preceding facts indicate a significant need for effective small 
molecule inhibitors of HCV replication that do not suffer from the above-mentioned drawbacks. 
A particularly useful class of inhibitors of HCV, as well as other positive strand RNA viruses, is 
inhibitors of viral RNA synthesis. 

[0005] While accurate and efficient assays for identifying HCV RNA synthesis inhibitors 
may be useful tools for identifying effective small molecule HCV therapeutics, no such system 
has been developed. An in vitro replication assay using recombinant NS5B polymerase has been 
reported. However, in this system, the purified form of NS5B polymerase lacked template 
specificity and produced various lengths of RNA products. These phenomena are very different 
from HCV RNA replication in vivo. To better reflect the HCV RNA replication process in the 
cell, a cell-free HCV replication system was established using whole cell lysates or membrane 
fractions of cells expressing the HCV replicon. In this cell-free system, radioactive P 32 -UTP or 
P 32 -CTP was used to label newly synthesized HCV RNA, and then the reaction products were 
resolved by gel electrophoresis, followed by autoradiography. Because these assays require gel 
electrophoresis to separate the full length HCV RNA from other RNA molecules, the results are 
difficult to quantify, often inaccurate, and poorly reproducible. Additionally, while it appears to 
be clear that RNA elongation occurs in this cell-free system, there is no convincing evidence that 
de novo RNA initiation occurs in this system. 
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SUMMARY 

[0006] An assay for detecting newly synthesized RNA in a cell-free system, and for 
identifying compounds that inhibit positive strand RNA viruses such as the Hepatitis C Virus 
' (HCV) is provided herein. 

[0007] Provided herein is an assay for detecting newly synthesized RNA from a positive 
strand RNA virus, such as HCV virus, which in more efficient and quantitative than previously 
reported assays. This assay is useful for identifying inhibitors of RNA synthesis of a positive 
strand RNA virus, including inhibitors of HCV RNA synthesis. Certain embodiments of the 
assay include methods for distinguishing RNA synthesis initiation inhibitors from RNA 
elongation inhibitors. 

[0008] Provided herein is a method for determining whether a test compound inhibits 
RNA synthesis of a positive strand RNA virus. The method comprises: 

contacting an isolated replicase complex for the positive strand RNA virus comprising a 
viral replicon template RNA for the positive strand RNA virus, a labeled nucleotide analog, and 
the test compound, under conditions sufficient for in vitro RNA synthesis, to form a newly 
synthesized RNA population comprising the labeled nucleotide analog; 

detecting the newly synthesized RNA population comprising the labeled nucleotide 

analog; 

quantitating the newly synthesized RNA population comprising the labeled nucleotide 
analog to provide a test RNA amount; and 

comparing the test RNA amount with a control RNA amount of a newly synthesized 
RNA population comprising the labeled nucleotide analog produced in the absence of the test 
compound, wherein a decrease in the test RNA amount compared to the control RNA amount 
indicates that the test compound inhibits RNA synthesis of the positive strand RNA virus. 

[0009] Further provided herein is a method for quantitating newly initiated RNA of a 
positive strand RNA virus comprising: 

contacting an isolated replicase complex for the positive strand RNA virus comprising a 
viral replicon template RNA for the positive strand RNA virus, and a labeled nucleotide analog, 
under conditions sufficient for in vitro RNA synthesis, to form a newly synthesized RNA 
population comprising the labeled nucleotide analog; 

hybridizing a probe and the newly synthesized RNA population comprising the labeled 
nucleotide analog, under stringent hybridization conditions, wherein the probe is complementary 
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to at least a portion of a transcription initiation region of the newly synthesized RNA population; 

digesting unhybridized, single-stranded RNA with a single-strand specific ribonuclease to 
form a protected RNA population; 

detecting a protected RNA population comprising the labeled nucleotide analog; and 

quantitating the protected RNA population comprising the labeled nucleotide analog. 

[0010] Also provided is a method for determining whether a test compound is an RNA 
synthesis initiation inhibitor of a positive strand RNA virus comprising: 

contacting an isolated replicase complex for the positive strand RNA virus comprising a 
viral replicon template RNA for the positive strand RNA virus, a labeled nucleotide analog, and 
the test compound, under conditions sufficient for in vitro RNA synthesis, to form a newly 
synthesized RNA population comprising the labeled nucleotide analog; 

hybridizing a probe and the newly synthesized RNA population comprising the labeled 
nucleotide analog, under stringent hybridization conditions, wherein the probe is complementary 
to at least a portion of an initiation region of the newly synthesized RNA population; 

digesting unhybridized, single-stranded RNA with a single-strand specific ribonuclease to 
form a protected RNA population; 

detecting a protected RNA population comprising the labeled nucleotide analog; 

quantitating the protected RNA population comprising the labeled nucleotide analog to 
provide a test RNA amount; and 

comparing the test RNA amount with a control RNA amount of protected RNA 
comprising the labeled nucleotide analog but produced in the absence of the test compound, 
wherein a decrease in the test RNA amount compared to the control RNA amount indicates that 
the test compound inhibits RNA synthesis initiation of the positive strand RNA virus. 
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BRIEF DESCRIPTION OF THE FIGURES 

[001 1] Figure 1 shows a schematic of the inhibition of HCV replicon replication by 
transcription initiation and elongation inhibitors. 

[0012] Figure 2 shows an autoradiogram illustrating use of an RNA synthesis inhibitor to 
demonstrate de novo initiation of RNA synthesis in isolated replication complexes. 

DETAILED DESCRIPTION 

[0013] Prior to setting forth the invention in detail, it may be helpful to provide 
definitions of certain terms to be used herein. Unless defined otherwise, all technical and 
scientific terms used herein have the same meaning as is commonly understood by one of skill in 
the art to which this invention belongs. 

Terminology and Molecular Description 

[0014] A "replicon" as used herein includes a genetic element, for example, a plasmid, 
cosmid, bacmid, phage or virus that is capable of replication largely under its own control. A 
replicon may be either RNA or DNA, and may be single or double-stranded. A preferred 
replicon is a replicon from a single-stranded positive strand RNA virus such as a virus of the 
Picornaviridae family, the Calciviride family, the Togaviridae family, the Coronaviridae family, 
or the Flaviviridae family. Viruses of the Flaviviridae family include, for example, Hepatitis C 
Virus, West Nile Virus, Dengue Virus, Kunjuin Virus, Yellow Fever Virus, Bovine Viral 
Diarrhea Virus, Tick Born Encephalitis Virus, Japanese Encephalitis Virus, and Venezuelan 
Equine Encephalitis Virus. 

[0015] In positive strand RNA viruses, replication is performed by a multi-protein-RNA 
complex called a "replicase complex". As used herein, a "replicase complex" is an active 
complex of polypeptides and RNA which is capable of complete and accurate viral replicon 
RNA synthesis under cell-free conditions suitable for viral RNA replication. By complete and 
accurate viral replicon RNA synthesis, it is meant that the replicase complex is capable of 
producing full-length viral replicon RNAs. In addition, the isolated replicase complex should 
show specificity for replication of the replicase RNA of the corresponding positive strand RNA 
virus. An isolated replicase complex preferably comprises a viral replicon template RNA for the 
positive strand RNA virus. An "isolated replicase complex" is a replicase complex which has 
been removed from its cellular environment, such as a cell expressing a viral replicon RNA. 
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"Isolated replicase complex" includes the membrane fraction of a cell expressing the viral 
replicase RNA. The isolated replicase complex may be separated from the cell nucleus, 
chromosomal DNA, and cytoplasmic materials, for example. In addition, an isolated replicase 
complex may comprise one or more polypeptides expressed from a recombinant expression 
system, so long as the complex is capable of complete and accurate viral replicon RNA 
synthesis. The replicase complex of HCV, for example, includes the NS5B protein which has 
RNA-dependent RNA polymerase activity, and other protein factors 

[0016] "Nucleic acid" or a "nucleic acid molecule" refers to a DNA or RNA molecule, 
either single or double-stranded and, if single-stranded, the molecule of its complementary 
sequence in either linear or circular form. A sequence or structure of a particular nucleic acid 
molecule can be described according to the normal convention of providing the sequence in the 
5' to 3' direction. 

[0017] The term "isolated nucleic acid molecule" includes nucleic acid molecules that 
are separated from an intact cellular environment. An "isolated nucleic acid molecule" may be, 
for example, a template RNA that is separated from the cell nucleus, chromosomal DNA, and 
other cellular materials which are not membrane-associated. Moreover, an "isolated" nucleic 
acid molecule, such as a viral replicon template RNA molecule, can be substantially free of other 
cellular material, or culture medium when produced by recombinant techniques, or substantially 
free of chemical precursors or other chemicals when chemically synthesized. In some 
embodiments, an isolated viral replicon template RNA may be purified as a portion of a 
membrane fraction of a cell expressing a viral replicon RNA. Such a membrane fraction may 
also comprise isolated replicase complexes. Substantially free of other cellular material includes, 
for example, a cellular fraction such as, for example, a membrane bound fraction. By 
substantially free of other cellular material, it is meant that an isolated nucleic acid molecule may 
be greater than or equal to about 70%, about 75%, about 80%, about 85%, about 90%, about 
95% or about 99% free of unwanted cellular materials, such as, for example, components of 
cellular fractions other than the membrane-bound fraction. 

[0018] In some embodiments, an "isolated" nucleic acid maybe free of sequences which 
naturally flank the nucleic acid (i.e., sequences located at the 5' and/or 3' ends of the nucleic 
acid) in the genomic DNA or RNA of the organism from which the nucleic acid is derived. For 
example, the isolated nucleic acid molecule can contain less than about 5 kb, about 4 kb, about 3 
kb, about 2 kb, about 1 kb, about 0.5 kb or about 0.1 kb of 5' and/or 3' nucleotide sequences 
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which naturally flank the nucleic acid molecule in genomic DNA of the cell from which the 
nucleic acid is derived. In this sense, an isolated nucleic acid maybe, for example, a DNA 
vector encoding a viral replicon RNA which has been purified by standard DNA purification 
methods. 

[0019] "Natural allelic variants", "mutants" and "derivatives" of particular sequences of 
nucleic acids refer to nucleic acid sequences that are closely related to a particular sequence but 
which may possess, either naturally or by design, changes in sequence or structure. By closely 
related, it is meant that greater than or equal to about 75%, but often, greater than or equal to 
about 90%, of the nucleotides of the sequence match over the defined length of the nucleic acid 
sequence. Changes or differences in nucleotide sequence between closely related nucleic acid 
sequences may represent nucleotide changes in the sequence that arise during the course of 
normal replication or duplication in nature of the particular nucleic acid sequence. Other 
changes may be specifically designed and introduced into the sequence for specific purposes, 
such as to change an amino acid codon or sequence in a regulatory region of the nucleic acid. 
Such specific changes may be made in vitro using a variety of mutagenesis techniques or 
produced in a host organism placed under particular selection conditions that induce or select for 
the changes. Such sequence variants generated specifically may be referred to as "mutants" or 
"derivatives" of the original sequence. 

[0020] Different "variants" including "natural allelic variants" of, for example, the HCV 
genome exist in nature. These variants may be alleles characterized by differences in the 
nucleotide sequences of the gene coding for a protein, or may involve different RNA processing 
or post-translational modifications. The skilled person can produce variants having single or 
multiple amino acid substitutions, deletions, additions or replacements. These variants may 
include inter alia: a) variants in which one or more amino acids residues are substituted with 
conservative or non-conservative amino acids, b) variants in which one or more amino acids are 
added, and c) variants in which one or more amino acids include a substituent group. 

[0021] "Operatively linked" refers to a juxtaposition wherein the components so 
described are in a relationship permitting them to function in their intended manner. An 
expression control sequence operatively linked to a coding sequence is ligated such that 
expression of the coding sequence is achieved under conditions compatible with the expression 
control sequences. As used herein, the term "expression control sequences" refers to nucleic 
acid sequences that regulate the expression of a nucleic acid sequence to which it is operatively 
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linked. Expression control sequences are operatively linked to a nucleic acid sequence when the 
expression control sequences control and regulate the transcription and, as appropriate, 
translation of the nucleic acid sequence. Thus, expression control sequences can include 
appropriate promoters, enhancers, transcription terminators, a start codon (i.e., atg) in front of a 
protein-encoding gene, splicing signals for introns (if introns are present), maintenance of the 
correct reading frame of that gene to permit proper translation of the mRNA, and stop codons. 
The term "control sequences" is intended to include, at a minimum, components whose presence 
can influence expression, and can also include additional components whose presence is 
advantageous, for example, leader sequences and fusion partner sequences. Expression control 
sequences can include a promoter. By "promoter" is meant minimal sequence sufficient to direct 
transcription. Also included are those promoter elements which are sufficient to render promoter- 
dependent gene expression controllable for cell-type specific, tissue-specific, or inducible by 
external signals or agents; such elements may be located in the 5' or 3' regions of the gene. Both 
constitutive and inducible promoters are included. 

[0022] The term "probe" as used herein refers to an oligonucleotide or polynucleotide 
comprising either RNA or DNA, whether occurring naturally as in a purified restriction enzyme 
digest or produced synthetically, which is capable of annealing with or specifically hybridizing 
to a nucleic acid with sequences complementary to the probe. A probe may be, for example, a 
single-stranded RNA transcribed in vitro from a DNA template. A probe may be either single- 
stranded or double-stranded, but is preferably single-stranded. The exact length of the probe will 
depend upon many factors, including the temperature, the source of the probe, and use. The 
probes are selected to be "substantially" complementary to a strand of a particular target nucleic 
acid sequence. This means that the probes are sufficiently complementary so as to be able to 
"specifically hybridize" or anneal with their respective target strands under a set of pre- 
determined conditions. The term specifically hybridize means that the probe has a greater 
probability of hybridizing to its target sequence than other non-target sequences. 

[0023] Therefore, the probe sequence need not reflect the exact complementary sequence 
of the target. For example, a non-complementary nucleotide fragment may be attached to the 5' 
or 3' end of the probe, with the remainder of the probe sequence being complementary to the 
target strand. A small (e.g., less than or equal to about 10%) of mismatched nucleotides in the 
primer and the target sequence is also permissible. 
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[0024] The term "oligonucleotide" is defined as a nucleic acid molecule comprising two 
or more ribonucleotides or deoxyribonucleotides, preferably more than three. The exact size of 
the oligonucleotide will depend on various factors and on the particular application and use of 
the oligonucleotide. Preferably, an oligonucleotide comprises less than 100 nucleotides, more 
preferably less than or equal to about 50 nucleotides, and most preferably less than or equal to 
about 30 nucleotides. 

[0025] A "polynucleotide" probe or primer preferably comprises less than or equal to 
about 1000 nucleotides, more preferably less than or equal to about 800 nucleotides, and most 
preferably less than or equal to about 500 nucleotides. A polynucleotide primer also preferably 
comprises greater than or equal to about 100 nucleotides, more preferably greater than or equal 
to about 150 nucleotides, and most preferably greater than or equal to about 200 nucleotides. 

[0026] "Stringency conditions" for hybridization is a term of art which refers to the 
incubation and wash conditions, e.g., conditions of temperature and buffer concentration, which 
permit hybridization of a first nucleic acid to a second nucleic acid; the first nucleic acid may be 
perfectly (i.e., 100%) complementary to the second, or the first and second may share some 
degree of complementarity which is less than perfect (e.g., 70%, 75%, 85%, 95%, 98%). For 
example, certain high stringency conditions can be used which distinguish perfectly 
complementary nucleic acids from those of less complementarity. 

[0027] "High stringency conditions", "moderate stringency conditions" and "low 
stringency conditions" for nucleic acid hybridizations are explained in Current Protocols in 
Molecular Biology (Ausubel, F.M. et al., "Current Protocols in Molecular Biology", John Wiley 
& Sons, (1998)). The exact conditions which determine the stringency of hybridization depend 
not only on ionic strength (e.g., 0.2X SSC, 0.1X SSC), temperature (e.g., room temperature, 
42°C, 68°C) and the concentration of destabilizing agents such as formamide or denaturing 
agents such as SDS, but also on factors such as the length of the nucleic acid sequence, base 
composition, percent mismatch between hybridizing sequences and the frequency of occurrence 
of subsets of that sequence within other non-identical sequences. SSC buffer is 150 mM sodium 
chloride and 15 mM sodium citrate, pH 7.0. High, moderate or low stringency conditions can be 
determined empirically. 

[0028] By varying stringency conditions for hybridization from a level of stringency at 
which no hybridization occurs to a level at which hybridization is first observed, conditions 
which will allow a given sequence to hybridize (e.g., selectively) with the most similar 
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sequences in the sample can be determined. 

[0029] Exemplary conditions are described in Krause, M.H. and S.A. Aaronson, Methods 
in Enzymology, 200:546-556 (1991). Also, in, Ausubel, et aL, "Current Protocols in Molecular 
Biology John Wiley & Sons, (1998), which describes the determination of washing conditions 
for moderate or low stringency conditions. Washing is the step in which conditions are usually 
set so as to determine a minimum level of complementarity of the hybrids. Starting from the 
lowest temperature at which only homologous hybridization occurs, each °C by which the final 
wash temperature is reduced (holding SSC concentration constant) allows an increase by about 
1% in the maximum extent of mismatching among the sequences that hybridize. Generally, 
doubling the concentration of SSC results in an increase in T m of about 17°C. Using these 
guidelines, the washing temperature can be determined empirically for high, moderate or low 
stringency, depending on the level of mismatch sought. 

[0030] For example, a low stringency wash can comprise washing in a solution 
containing 0.2X SSC/0.1% sodium dodecyl sulfate (SDS) for 10 minutes at room temperature; a 
moderate stringency wash can comprise washing in a prewarmed solution (42°C) solution 
containing 0.2X SSC/0.1% SDS for 15 minutes at 42°C; and a high stringency wash can 
comprise washing in prewarmed (68°C) solution containing 0.1X SSC/0.1% SDS for 15 minutes 
at 68°C. Furthermore, washes can be performed repeatedly or sequentially to obtain a desired 
result as known in the art. Equivalent conditions can be determined by varying one or more of 
the parameters given as an example, as known in the art, while maintaining a similar degree of 
identity or similarity between the target nucleic acid molecule and the primer or probe used. 

[0031] A "test compound" as defined herein refers to a chemical, nucleic acid, 
polypeptide, amino acid, or other compound which is to be tested. Examples of test compounds 
include, but are not limited to, drug candidates, such as derived from arrays of small molecules 
generated through general combinatorial chemistry, as well as any other substances thought to 
have potential biological activity. 

DESCRIPTION OF THE ASSAYS 

[0032] Provided herein is a method for determining whether a test compound inhibits 
RNA synthesis of a positive strand RNA virus. The method comprises: 

contacting an isolated replicase complex for the positive strand RNA virus comprising a 
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viral replicon template RNA for the positive strand RNA virus, a labeled nucleotide analog, and 
the test compound, under conditions sufficient for in vitro RNA synthesis, to form a newly 
synthesized RNA population comprising the labeled nucleotide analog; 

detecting the newly synthesized RNA population comprising the labeled nucleotide 

analog; 

quantitating the newly synthesized RNA population comprising the labeled nucleotide 
analog to provide a test RNA amount; and 

comparing the test RNA amount with a control RNA amount of a newly synthesized 
RNA population comprising the labeled nucleotide analog produced in the absence of the test 
compound, wherein a decrease in the test RNA amount compared to the control RNA amount 
indicates that the test compound inhibits RNA synthesis of the positive strand RNA virus. 

[0033] Further provided herein is a method for quantitating newly initiated RNA of a 
positive strand RNA virus comprising: 

contacting an isolated replicase complex for the positive strand RNA virus comprising a 
viral replicon template RNA for the positive strand RNA virus, and a labeled nucleotide analog, 
under conditions sufficient for in vitro RNA synthesis, to form a newly synthesized RNA 
population comprising the labeled nucleotide analog; 

hybridizing a probe and the newly synthesized RNA population comprising the labeled 
nucleotide analog, under stringent hybridization conditions, wherein the probe is complementary 
to at least a portion of a transcription initiation region of the newly synthesized RNA population; 

digesting unhybridized, single-stranded RNA with a single-strand specific ribonuclease to 
form a protected RNA population; 

detecting a protected RNA population comprising the labeled nucleotide analog; and 

quantitating the protected RNA population comprising the labeled nucleotide analog. 

[0034] Also provided is a method for determining whether a test compound is an RNA 
synthesis initiation inhibitor of a positive strand RNA virus comprising: 

contacting an isolated replicase complex for the positive strand RNA virus comprising a 
viral replicon template RNA for the positive strand RNA virus, a labeled nucleotide analog, and 
the test compound, under conditions sufficient for in vitro RNA synthesis, to form a newly 
synthesized RNA population comprising the labeled nucleotide analog; 

hybridizing a probe and the newly synthesized RNA population comprising the labeled 
nucleotide analog, under stringent hybridization conditions, wherein the probe is complementary 
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to at least a portion of an initiation region of the newly synthesized RNA population; 

digesting unhybridized, single-stranded RNA with a single-strand specific ribonuclease to 
form a protected RNA population; 

detecting a protected RNA population comprising the labeled nucleotide analog; 

quantitating the protected RNA population comprising the labeled nucleotide analog to 
provide a test RNA amount; and 

comparing the test RNA amount with a control RNA amount of protected RNA 
comprising the labeled nucleotide analog but produced in the absence of the test compound, 
wherein a decrease in the test RNA amount compared to the control RNA amount indicates that 
the test compound inhibits RNA synthesis initiation of the positive strand RNA virus. 

[0035] In some embodiments of the above methods, the isolated viral replicase 
complexes comprising a viral replicon template RNA may be provided by transfecting a cell line 
with an isolated DNA template for a viral replicon or an isolated viral RNA, to provide a 
transfected cell line, incubating the transfected cell line under conditions suitable for viral 
replication, and isolating the replicase complexes comprising viral replicon template RNA from 
the cell membrane fraction of the transfected cell line. 

[0036] In other embodiments of the above methods, the isolated viral replicase 
complexes comprising viral replicon template RNAs may be provided by acutely infected or 
persistent infected primary hepatocytes, lymphocytes or other cell lines, incubating the infected 
cell line under conditions suitable for viral replication, and isolating the replicase complexes 
comprising viral replicon RNA from the cell membrane fraction. The replication complexes may 
be isolated from infected primary cells or cell lines. 

[0037] In certain embodiments of the methods described herein, the positive strand RNA 
virus is Hepatitis C Virus, and a suitable DNA template for the HCV viral replicon is, for 
example, SEQ ID NO: 1, which can also be described as GenBank Accession No. AJ242652. 
Other suitable replicon template DNAs include, for example, AB1 14136, AJ242654, AJ242653, 
and AJ242651 (SEQ ID NOs. 2-5). 

[0038] In certain embodiments of the methods, the labeled nucleotide analog is an analog 
capable of being recognized by a specific antibody, an analog which can be recognized via a 
high specificity binding reaction, or an analog directly detectable as a result of a physical 
property of the analog. In some embodiments, the labeled nucleotide analog is an analog capable 
of being recognized by a specific antibody, such as 5'-bromouridine 5' -triphosphate (Br-UTP). 
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In other embodiments, the labeled nucleotide analog is a radioactively labeled nucleotide. 

[0039] In the methods described herein, detection of a newly synthesized RNA 
population may be accomplished by contacting a newly synthesized RNA population with an 
antibody specific for the labeled nucleotide analog, and immuno-precipitating the newly 
synthesized RNA population comprising the labeled nucleotide analog to form an immuno- 
precipitated RNA population. In some embodiments, the nucleotide analog is Br-UTP, and the 
specific antibody is an anti-BrdU antibody. The methods described herein include quantitating 
the newly synthesized RNA population by performing real time PCR on the immuno-precipitated 
RNA population. In other embodiments, detection of a newly synthesized RNA population may 
be accomplished by detecting a radioactively labeled nucleotide by, for example, 
autoradiography. 

[0040] In some embodiments, a 2'-0-methylated nucleotide may be employed during the 
replication reaction to increase the yield of newly synthesized RNA. 

[0041] In order to form a newly synthesized RNA population, an isolated replicase 
complex comprising a viral replicase RNA template is employed. The isolated viral replicon 
template RNA comprises a transcription initiation region. The template RNA may be, for 
example, isolated as a component of the replicase complex. In some embodiments, an isolated 
viral template RNA may be added to the replicase complex. 

[0042] A viral replicon RNA or a DNA vector encoding the viral replicon RNA may be 
transfected into a cell line suitable for expression of the viral replicon RNA. Suitable cell lines 
include, for example, mammalian cell lines such as Vero cells, HeLa cells, CHO cells, COS 
cells, WI38 cells, N1H-3T3 cells (and other fibroblast cells, such as MRC-5 cells), MDCK cells, 
KB cells, SW-13 cells, MCF7 cells, BHK cells, HEK-293 cells, HepG2 cells, Bowes melanoma 
cells cell lines; and chicken embryonic fibroblast (CEF) cell lines. Preferred cell lines include 
human cell lines, such as, for example, human hepatosoma cell lines such as Huh-7. Suitable 
means of transfection include, for example, calcium phosphate co-precipitates, mechanical 
procedures such as microinjection, electroporation, insertion of a plasmid encased in liposomes, 
as well as other techniques known in the art. 

[0043] A suitable DNA vector for production of an HCV replicon RNA, for example, 
comprises, 5' -3', the HCV-IRES, the neomycin phosphotransferase (neo) gene, the IRES of 
encephalomyocarditis virus, which directs translation of HCV sequences NS3 to NS5B, and the 
3 5 -NTR. The sequence of a DNA vector suitable for production of the HCV replicon RNA has 
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been deposited in GenBank (Accession no. AJ242652) (SEQ ED NO:l). The HCV replicon 
DNA vector may be transfected into Huh-7 cells by electroporation. HCV replicon RNA may be 
transcribed in vitro using either SP6 RNA polymerase or T7 RNA polymerase from a plasmid 
DNA comprising the appropriate promoter with the cDNA encoding the replicon RNA 
downstream of the promoter. The DNA template for production of a viral replicase RNA may be 
produced by PCR amplification using a DNA template encoding the viral replicon and specific 
primers one of which comprises an SP6 or T7 RNA polymerase promoter. Alternatively, the 
DNA template for production of a viral replicase RNA may be produced by reverse transcription 
of the viral replicase RNA using primers one of which comprises an SP6 or T7 RNA polymerase 
promoter. 

[0044] Once the viral replicon RNA or DNA vector encoding the viral replicon RNA is 
transfected into a cell, the cell culture may be grown to confluency. The cells in the culture 
produce replicase complexes comprising viral replicon template RNAs, which can then be 
isolated from the cells. Viral replicon RNA includes minus strand viral replicon template RNA, 
and optionally positive strand RNA. 

[0045] Alternatively, infected cells such as acutely infected or persistent infected primary 
hepatocytes, lymphocytes or other cell lines. Infected primary cells or cell lines may be 
employed. The infected cell line is incubated under conditions suitable for viral replication, and 
the replicase complexes comprising viral replicon RNA are isolated from the cell membrane 
fraction. 

[0046] In order to isolate the replication complexes comprising viral replicon template 
RNA, the cells (i.e., transfected or infected) may be lysed and centrifuged at low speed (e.g., 
about 900 X to about 1000 X g) to remove cellular debris. The supernatant may then be 
centrifuged at higher speed (e.g., about 15000 X g) to obtain the membrane fraction, which 
comprises isolated viral replicon complexes comprising viral replicon template RNA. It has 
been shown that replication of the HCV RNA occurs in a membrane-bound replication complex. 
Thus, HCV replicon complexes and replicon template RNA can be isolated by isolating the 
membrane fraction of the cells expressing the HCV replicon RNA. 

[0047] The isolated membrane fraction containing replicase complexes comprising viral 
replicon template RNAs (e.g., an aliquot of the membrane fraction of a cell line expressing a 
viral RNA replicon) is used to perform RNA replication in vitro in a cell-free system. The 
isolated replicase complexes comprising viral replicon template RNA are contacted with a 
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labeled nucleotide analog under conditions sufficient to promote in vitro RNA synthesis. 
Conditions sufficient to promote in vitro RNA synthesis include suitable buffers, and nucleoside 
triphosphates (i.e., ATP, UTP, CTP, and GTP). The labeled nucleotide analog is incorporated 
into a newly synthesized RNA population, and provides a means for detecting only the newly 
synthesized RNA, and not the template RNA. In one embodiment, suitable conditions for 
replication include, for example, incubation in 50 /xl (total volume) of 50 mM HEPES (pH 7.5), 
10 mM KC1, 10 mM MgCl 2 , 50 units RNasin, 10 /xg/ ml actinomycin D, 2.5 mM ATP, 0.5 mM 
each CTP, GTP, labeled UTP and an aliquot of a membrane fraction of a cell line expressing the 
viral replicon RNA, e.g., the HCV replicon. In another embodiment, suitable replication 
conditions include 50 mM HEPES (pH 7.3); 10 mM KC1; 10 mM MgCl 2 ; 0.3 mM MnCl 2 ; 20 
Units of RNAse inhibitor; 10 /xg of actinimycin D per mL; 0.5 mM ATP,GTP, and UTP; 10 /xCi 
of [oc-P 32 ] CTP; and 6 /xl of the membrane fraction of a cell line expressing the viral replicon 
RNA in a total volume of 60 /xl. The replication reaction is continued for a time sufficient to 
produce a desired amount of newly synthesized RNA. 

[0048] In some embodiments, the replication conditions may include adding a 2'-0- 
methylated nucleotide such as, for example, 2'-0-methyl-5-methyluridine-5'- triphosphate to the 
replication mixture. The inclusion of a 2'-0-methylated nucleotide such as 2'-0-methyl-5- 
methyluridine-5 'triphosphate may increase the efficiency of the replication complex, resulting in 
an increase in the amount of labeled product obtained in the absence of the 2'-0-methyl-5- 
methyluridine-5'- triphosphate. Without being held to a particular theory, it is believed that a 2'- 
O-methylated nucleotide, particularly 2'-0-methyl-5-methyluridine-5'- triphosphate increases 
the efficiency of the replicase complex when a radiolabeled nucleotide such as, for example, P 32 - 
CTP is added to the reaction at low concentrations. 

[0049] Suitable labeled nucleotide analogs include, for example, analogs which are 
capable of being recognized by specific antibodies (e.g., Br-UTP), analogs such as a biotin 
labeled nucleotide (e.g., biotin-CTP) which can be recognized via a high specificity binding 
reaction (i.e., biotin/avidin or biotin/streptavidin binding), and analogs which are directly 
detectable as a result of a physical property of the analog, such as radioactivity (e.g., a re- 
labeled nucleotide), fluorescence, luminescence etc., for example a fluorescein-nucleotide (e.g., 
fluorescein-UTP). In one embodiment, the labeled nucleotide analog is 5'-bromouridine 5'- 
triphosphate (Br-UTP). The RNA containing Br-UTP may be isolated by immuno-precipitation 
and/or detection with an anti-BrdU antibody. The anti-BrdU antibody may optionally be labeled 
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with a tag suitable for direct detection of the antibody such as, for example, fluorescein, or other 
dyes such as those available from Molecular Probes. In other cases, the antibody may be 
detected via binding to a labeled secondary antibody such as, for example, a rabbit or mouse IgG 
labeled with alkaline phosphatase or horseradish peroxidase. 

[0050] The labeled nucleotide analog is added to the replication mixture so that the 
labeled nucleotide analog is incorporated into the newly synthesized RNA (e.g., newly 
synthesized viral replicon RNA) in place of at least a portion of the corresponding unlabeled 
nucleotide. For example, Br-UTP may be used in place of at least a portion of the UTP. In 
certain cases, all of the corresponding nucleotide may be replaced with the labeled nucleotide 
analog. After replication, there are two populations of RNA (e.g., viral replicon RNA): an 
unlabeled RNA population and a labeled RNA population. The newly synthesized RNA 
population, which contains the labeled nucleotide analog, can be distinguished from the RNA 
population which was present before the replication step, because the RNA population which 
was present before replication will not contain the labeled nucleotide analog. Because the newly 
synthesized RNA can be distinguished from the RNA present prior to replication, the amount of 
newly synthesized RNA (e.g., viral replicon RNA) can be detected and/or quantified by 
employing a means of RNA detection and/or quantitation. The means for RNA detection and 
quantitation may be the same or different. 

[0051] After replication, the RNA in the replication mixture optionally may be purified 
using, for example, a commercially available RNA purification kit. The newly synthesized RNA 
is then detected using a means for detecting the RNA. Detection preferably comprises 
quantitative detection. The method for detecting the RNA is selected based on the labeled 
nucleotide analog employed. For example, in the case of an analog capable of being recognized 
by specific antibodies, a specific antibody may be employed to immuno-precipitate the labeled 
RNA. The immuno-precipitated RNA may be detected, for example, by directly detecting a 
fluorescent tag present on the antibody. Alternatively, the immuno-precipitated RNA may be 
detected with a labeled secondary antibody. In yet another alternative, the immuno-precipitated 
labeled RNA can be amplified and detected using real-time PCR. 

[0052] For example, a newly synthesized RNA population containing Br-UTP as the 
labeled nucleotide analog may be immuno-precipitated using an anti-BrdU monoclonal antibody. 
The unlabeled, template RNA will not be precipitated, while the newly synthesized Br-UTP- 
labeled RNA will be specifically precipitated. The immuno-precipitated, labeled RNA may then 
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be quantified using real time PCR. Thus, in the case of, for example, a Br-UTP labeled viral 
replicon RNA precipitated with anti-BrdU antibody, the method for detection comprises 
contacting the population of viral replicon RNAs with an anti-BrdU antibody, precipitating the 
viral replicon RNAs, and detecting and quantitating the labeled viral replicon RNAs using a 
suitable method such as, for example, real time PCR. 

[0053] Real time PCR (polymerase chain reaction) is a quantitative reverse transcription- 
PCR reaction (RT-PCR). Early in RT-PCR, reagents are in excess, template and product are at 
low enough concentrations that product renaturation does not compete with primer binding, and 
amplification proceeds at a substantially constant, exponential rate. At some variable point 
during the reaction, the reaction rate ceases to be exponential, and enters a linear phase of 
amplification. Late in the amplification cycle, little product is made. Real time PCR allows the 
collection of data during the exponential phase of amplification, and thus allows for accurate 
quantitation of the amount of amplified product. Detection and quantitation of the double- 
stranded DNA produced in the PCR reaction is done using a fluorescent reporter, the signal of 
which increases in direct proportion to the amount of double-stranded DNA PCR product in a 
reaction. Taqman® probes labeled with fluorescent dye such as FAM (6-carboxy- fluorescein) 
may be used. Another suitable reporter is the double-strand DNA-specific dye SYBR® Green 
(Molecular Probes). SYBR® Green binds double-stranded DNA, and upon excitation emits 
light. Thus, as a PCR product accumulates, fluorescence increases. Other alternatives reporters 
include hybridization probes which use fluorescence energy transfer (FRET) for detection. 

[0054] When the labeled nucleotide analog is a radioactive nucleotide such as P 32 -CTP, 
the labeled RNAs may be detected by autoradiography, direct radioactive detection such as 
Phosphorimaging, and the like. 

[0055] In the case of a labeled nucleotide analog which can be recognized via a high 
specificity binding reaction, the labeled RNAs may be isolated using the high specificity binding 
reaction. For example, a biotin-labeled nucleotide can be recognized by avadin or streptavadin. 
Once the labeled RNAs have been isolated, the labeled RNAs can be amplified and detected 
using real-time PCR. In the case of a directly detectable nucleotide analog, the labeled 
nucleotide analog can be directly detected using fluorescence, luminescence, and the like. 

[0056] Suitable controls for quantitation of the newly synthesized RNA population 
follow. Template RNAs isolated from cells untransfected with a DNA vector or viral RNA and 
subjected to replication, have no detectable amount of newly synthesized RNA. Template RNAs 
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isolated from cells transfected with a template DNA vector or viral RNA, but replicated without 
the labeled nucleotide analog, have only background levels of newly synthesized RNA. 
Template RNAs from cells transfected with template DNA vector or viral RNA, replicated in the 
presence of a labeled nucleotide analog, which are not detected with the means for detection, 
show only background levels of newly synthesized RNA. Template RNAs from cells transfected 
with template DNA vector or viral RNA, replicated in the presence of labeled nucleotide analog 
and detected with the means for detection should, however, show a strong newly synthesized 
RNA signal in this assay. 

[0057] The assay may be employed to quantitate a newly initiated RNA population. In 
the case of HCV, for example, most (e.g., about 90%) of the newly synthesized viral RNAs are 
the elongation products of previously initiated (i.e., initiated) template RNAs. In an initiated 
template RNA, a portion of the newly synthesized RNA is previously made using a minus strand 
template. Upon addition of the appropriate factors (e.g., buffers and nucleoside triphosphates), 
elongation of the initiated RNA can proceed to form the full-length RNA. Only a small fraction 
(e.g., less than about 10%) of the newly synthesized RNAs represent newly initiated products 
formed from uninitiated template RNAs (i.e., pre-initiation). Both the previously initiated and 
newly initiated RNAs will comprise the labeled nucleotide analog in the elongation region of the 
RNA. Only the newly initiated RNAs, however, will comprise the labeled nucleotide analog in 
the transcription initiation region. A method to select and quantitate newly initiated RNAs 
comprises employing RNA protection to the newly synthesized RNA population which may 
comprise previously initiated RNA population as well as newly initiated RNAs. 

[0058] In the RNAse protection assay, a nucleic acid probe is added to the newly 
synthesized viral replicon RNA population which may comprise previously initiated RNAs as 
well as newly initiated RNAs. The nucleic acid probe comprises a region complementary at least 
a portion of the transcription initiation region of the viral replicon RNAs (i.e., thenewly 
synthesized RNA population). When the probe is hybridized to the population of newly 
synthesized viral replicon RNAs, a portion of the transcription initiation region becomes double- 
stranded, with the remainder of the RNA being single-stranded. The single-stranded RNA may 
then be removed, for example, by digestion with a single-strand specific ribonuclease specific for 
single-stranded RNA. Suitable single-strand specific ribonucleases include, for example, 
ribonuclease Tl, ribonuclease A, nuclease SI, and combinations comprising one or more of the 
foregoing single-strand specific ribonucleases. After digestion, the remaining RNA will thus be 
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a double-stranded RNA (i.e., protected) population in which the previously initiated RNAs will 
comprise no labeled nucleotide analog, and the newly initiated RNAs comprise the labeled 
nucleotide analog. The protected newly initiated RNAs can then be detected and/or quantified as 
described above. 

[0059] Hybridization of the probe may be performed under conditions which are 
sufficient to allow specific hybridization of the nucleic acid probe to the viral replicase RNA. 
Specific hybridization can be performed under high stringency conditions or moderate stringency 
conditions, for example. 

[0060] In a particularly preferred embodiment, the hybridization conditions for specific 
hybridization are high stringency. 

[0061] The assay described herein may be employed to screen test compounds for 
inhibition of viral replicon RNA synthesis. Such compounds include replicase complex activity 
inhibitors as well as RNA synthesis initiation and elongation inhibitors. The method may be 
employed to distinguish initiation inhibitors from elongation inhibitors. If a viral replicon RNA 
synthesis inhibitor is added to a replication mixture, less viral replicon RNA should be 
synthesized, and the amount of labeled viral replicon RNA detected will be decreased (e.g., test 
RNA amount) relative to the amount of viral replicon RNA in a control sample (i.e., control 
RNA amount) with no added inhibitor. The amount of labeled viral replicon RNA detected will 
be decreased, for example, if the viral replicon RNA synthesis inhibitor is a viral replicon RNA 
elongation inhibitor. Viral replicon RNA elongation inhibitors will block synthesis of newly 
initiated viral replicon RNA elongation products and also previously initiated viral replicon RNA 
elongation products. If, however, the viral replicon RNA synthesis inhibitor is an initiation 
inhibitor, signal strength will be only modestly affected. Most (e.g., approximately 90%) of the 
newly synthesized viral RNAs of HCV, for example, are the elongation products of previously 
initiated viral replicon RNAs. Thus, only a small fraction (e.g., less than about 10%) of the 
newly synthesized replicon RNAs will be affected by an initiation inhibitor. FIGURE 1 provides 
a schematic of the inhibition of HCV RNA synthesis by initiation and elongation inhibitors. 

[0062] An RNAse protection assay, as described herein, can be added after formation of 
the newly synthesized RNA population to identify initiation inhibitors. In this assay, a nucleic 
acid probe which is complementary to at least a portion of the initiation region of the newly 
synthesized viral replicon is employed. The probe is hybridized to the viral replicon RNA such 
that the region of the viral replicon RNA complementary to the probe is double-stranded (i.e., 
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replicated), and the remainder of the viral replicon RNA is single-stranded. Hybridization is 
preferably performed under stringent conditions. The single-stranded portion of the viral 
replicon RNA is then digested with a ribonuclease specific for single-stranded RNA such as, for 
example, ribonuclease A, ribonuclease Tl, nuclease SI, or a combination comprising one or 
more of the foregoing ribonucleases. The remaining population of double-stranded replicon 
RNA will then contain two fractions: a fraction of previously initiated viral replicon RNAs 
which comprise no labeled nucleotide analog, and a fraction of viral replicon RNAs initiated 
after addition of the labeled nucleotide analog which comprise the labeled nucleotide analog. 
The protected viral replicon RNAs containing the labeled nucleotide analog can then be 
specifically detected and/or quantified as described above. 

[0063] In the case of HCV, an exemplary probe is an RNA complimentary to nucleotides 
15 to 434 of the HCV replicon RNA (SEQ ID NO:6), which hybridizes with the newly 
synthesized viral replicase RNA near the initiation codon. The RNA not hybridized to the probe 
is digested with a single- stranded ribonuclease, leaving only nucleotides 15 to 434 of the 
replicon RNA. The digested RNA will have two double-stranded RNA populations: a 
population of previously initiated replicon RNAs which do not contain the labeled nucleotide 
analog, and a population of newly initiated replicon RNAs which contain the labeled nucleotide 
analog. Thus, previously initiated HCV RNA elongation products will not be detected, while 
newly synthesized RNAs will be detected. In this assay including RNA protection, when an 
initiation inhibitor is added to the replication mixture, a reduced amount of labeled nucleotide 
analog is incorporated at the initiation site, and the signal measured by the means for detection 
will be decrease decreased. 

[0064] In positive strand RNA viruses, the replicase complex is a multi-protein complex 
which replicates viral replicon RNA from a viral replicon template RNA. Replicase complexes 
which include viral replicon template RNAs in addition to replication proteins can be isolated as 
a membrane bound fraction from cells expressing the viral replicon RNA. The isolated replicon 
complexes include both previously initiated and uninitiated viral replicon template RNAs. Upon 
addition of nucleotide triphosphates and other components sufficient for RNA synthesis, 
elongation of previously initiated template RNAs proceeds to produce full-length viral replicase 
RNAs. It had not, however, been demonstrated that de novo initiation of uninitiated viral 
replicon template RNAs occurs in isolated replicase complexes. By using an inhibitor of 
initiation of viral replicon RNA synthesis initiation, it can be shown that de novo initiation of 
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viral replicase RNA synthesis occurs in isolated replicase complexes. Because de novo initiation 
of RNA synthesis takes place in the isolated replicase complexes, the methods may be used to 
quantify newly initiated RNA and also to identify inhibitors of RNA synthesis initiation. 

[0065] Thus, the described methods are suitable for detecting inhibitors of RNA 
synthesis of positive strand RNA viruses such as RNA initiation and elongation inhibitors. 

[0066] Also provided herein is a kit for screening a test compound for inhibition of RNA 
synthesis of a positive strand RNA virus. The kit comprises a replicase complex for the positive 
strand RNA virus comprising a viral replicon template RNA for the positive strand RNA virus, 
instructions for use of the kit, and buffers and nucleoside triphosphates which are sufficient for 
synthesis of the viral replicon RNA. Instructions may include, for example, instructions for 
quantitating a newly synthesized RNA population, instructions for determining if a compound is 
an RNA synthesis inhibitor, and the like. The instructions may be written instructions, although 
electronic storage media (e.g., magnetic diskette or optical disk) containing instructions are also 
acceptable, relating to the use of components of the methods. The instructions included with the 
kit may include information as to reagents (e.g., whether included or not in the kit) necessary for 
practicing the methods, instructions on how to use the kit, and/or appropriate reaction conditions. 

[0067] Suitable buffers include those described previously as suitable for performing 
replication with isolated replication complexes. 

[0068] The kits may optionally further comprise, for example, a labeled nucleotide 
analog, a means for detecting and/or quantitating a labeled nucleotide analog, a primer, a single- 
strand specific ribonuclease, and instructions for performing an RNAse protection assay. 

[0069] The component(s) of the kit may be packaged in a convenient, appropriate 
packaging. The components may be packaged separately, or in one or multiple combinations. 

[0070] The invention is further illustrated by the following nonlimiting examples. 

EXAMPLES 

Example 1 . Growth and Maintenance of HCV Replicon Containing Cells 1A. HCV 
Replicon Expression and Cell Maintenance 

[0071] RNA molecules encoding the HCV replicon (i.e., viral replicon RNAs) are 
transfected into Huh-7 cells using electroporation. 

[0072] The equipment and materials for cell maintenance include, but are not limited to, 
Huh-7 HCV replicon-containing cells, maintenance media (DMEM (Dulbecco's modified Eagle 
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media) supplemented with 10% FBS, L-glutamine, non-essential amino acids, penicillin (100 
units/ml), streptomycin (100 micrograms/ml), and 500 micrograms/ml of Geneticin (G418), 
screening media (DMEM supplemented with 10% FBS, L-glutamine, and non-essential amino 
acid, penicillin (100 units/ml) and streptomycin (100 micrograms/ml)), 96 well tissue culture 
plates (flat bottom), 96 well plates (U bottom for drug dilution), Interferon alpha for positive 
control, fixation reagent (such as methanol: acetone), primary antibody (rabbit anti-NPTII), 
secondary antibody: Eu-Nl 1, and enhancement solution. 

[0073] Cells containing the HCV replicon support high levels of viral RNA replication 
when their density is suitable. Over-confluency may cause decreased viral RNA replication. 
Therefore, cells should be grown in log phase in the presence of 500 micrograms/ml of G418. 
Generally, cells should be passed twice a week at 1 : 4-6 dilution. Cell maintenance is conducted 
as follows: 

[0074] Cells containing the HCV replicon DNA vector are examined under a microscope 
to ensure that the cells are growing well. Cells are rinsed once with phosphate buffered saline 
(PBS) and 2 ml trypsin is added. The cell/ trypsin mixture is incubated at 37°C in a CO2 
incubator for 3-5 minutes. After incubation 10 ml of complete media is added to stop the 
trypsinization reaction. Cells are blown gently, put into a 15 ml tube, and spun at 1200 rpm for 4 
minutes. The trypsin/ medium solution is removed and the pelleted cells are recovered. 

Example 2. Quantitative assay for detection of newly synthesized HCV RNA in cell- 
free SYSTEM 

[0075], Membrane Fractions are purified from Huh-7 cells transfected with the HCV 
replicon RNA according to the procedure given by Hardy, et al. {J. Virol (2003) 77:2029-2037). 
Briefly, the HCV replicon-containing cells are washed with IX PBS, re-suspended in cold 
hypotonic buffer (lOmM Tri-HCl, pH 7.8, 10 mM NaCl), and put on ice for 20 minutes. The 
swelled cells are disrupted using a dounce homogenizer. The mix is centriflxged at 900 X g for 5 
minutes at 4°C. The supernatant is transferred to a fresh tube and centrifiiged at 15000 X g for 
25 minutes at 4°C. The pellet, which contains the membrane fraction, is re-suspended in storage 
buffer (hypotonic buffer with 15% glycerol), and may be stored at -80°C. 

[0076] In vitro RNA replication is performed according to the procedure of Lai Virol 
(2003) 77:2295-2300) with some modifications. Test compound (1 /xM to 100 /iM in 0.5 jd 
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DMSO) is pre-incubated the membrane faction (10 /xl) for 10 minutes at 30°C before the 
replication mix is added. The replication mix (50 /xl total volume) contains 50 mM HEPES (pH 
7.5), 10 mM KC1, 10 mM MgCl 2 50 U RNAin, 10 fig/ ml actinomycin D, 2.5 mM ATP, 0.5 mM 
each CTP, GTP, Br-UTP and the pre-incubated membrane fraction (i.e., the replicase complex 
and viral replicon template RNA). The replication mix is incubated at 30°C for 2 hours, 
followed by purification using TRIZOL (Invitrogen, Carlsbad, CA) or RNA EASY (Qiagen, 
Valencia, CA) according to manufacturer's instructions. The RNA samples are then immuno- 
precipitated using anti-BrdU monoclonal antibody (Molecular Probes, Eugene, OR) and protein 
A agarose (Invitrogen) in 200 [il to 400 /xl of precipitation buffer containing IX PBS, 0.05% NP 
40, 0.1 /xg/ /xl tRNA, 5 U/ fi\ RNasin, and 0.3 M uridine. 

[0077] When the newly synthesized segment near the initiation site is of interest, an RNA 
protection assay is used after Br-UTP labeling and RNA purification. The RNA probe for 
segment from 15 to 434 nt (SEQ ID NO: 6) is synthesized using an in vitro transcription kit 
(MAXIscript, Ambion). The DNA template for in vitro transcription was made by PCR using 
forward primer 5' GGGGGCGACACTCCACCATAGAT (15-37) (SEQ ID NO: 7) and reverse 
primer 5 f ATTTAGGTGACACTATAGAAACCCAAGCGGCCGGAGAACCT (413-434, plus 
SP6 core sequence) (SEQ ID NO: 8) on DNA vector for replicon HCV RNA or cDNA made 
from replicon HCV RNA by reverse transcription. The RNA is protected using an RNA protect 
assay kit (e.g. RPA III, Ambion, Austin, TX). Briefly, in the RNA protection assay, the newly 
synthesized replicon RNAs are hybridized to the RNA probe. The hybridized RNAs are then 
treated with a mixture of ribonuclease A and ribonuclease Tl to digest the single-stranded RNA. 
The remaining double-stranded RNA may then be precipitated prior to detecting. 

[0078] The newly synthesized HCV RNA produced in the cell-free system is 
quantitatively detected using real time PCR. Table 1 shows the data for several control 
experiments. A Taqman® probe labeled with fluorescent dye FAM was used. 
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TABLE 1 


Sample 


Copy number after PCR with 
no immuno-precipitation with 
anti-BrdUTP antibody 


Copy number after PCR 
performed after immuno- 
precipitation with anti- 
Brdu lr antibody 


Template RNA isolated from 
Huh-7 cells expressing HCV 
replicon with Br-UTP 


1.86X10' 


2.05 X 10 s 


Template RNA isolated from 
Huh-7 cells expressing HCV 
replicon with Br-UTP: No 
anti-r>ru-u ir antibody 


1.85 XI 0' 


2401 


Membranes isolated from 
Huh-7 cells with no HCV (no 
HCV template RNA) 


15 


3 


Template RNA isolated from 
Huh-7 cells expressing HCV 
replicon: No Br-UTP in 
replication 


1.79 X 10' 


7270 



[0079] As seen from Table 1, for template RNA isolated from Huh-7 cells expressing the 
HCV replicon with Br-UTP in the replication mixture, a high copy number of replicon RNA is 
observed both with and without precipitation with an anti-BrdUTP antibody. If no anti-BrdUTP 
antibody is used, the sample without precipitation has a high copy number of HCV replicon 
RNA, while the precipitated population has only a background amount. If the Huh-7 cells are 
not transfected with the HCV replicon (i.e., no template RNA), a background amount of replicon 
RNA is observed with or without precipitation with an anti-BrdU antibody. If there is no Br- 
UTP in the replication mixture with template RNA isolated from Huh-7 cells expressing the 
HCV replicon, the sample without precipitation has a high copy number of HCV replicon RNA, 
while the precipitated population has only a background amount. Thus, the control experiments 
give the expected results. 

[0080] As shown in Table 2, the copy number of the HCV RNA measured by 
quantitative PCR is dependent upon the amount of template RNA added to the replication 
mixture. In Table 2, HCV is a replication mixture with template RNA isolated from Huh-7 cells 
expressing the HCV replicon. HVC (1/2) and HCV (1/8) are replication mixtures containing X A 
and 1/8 of the amount of template RNA, respectively. Huh-7 (no HCV) is a no HCV template 
control as described above. No Br-UTP is a control in which template RNA isolated from Huh-7 
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cells expressing the HCV replicon, but no Br-UTP is added to the replication mixture. 



TABLE 2 


odinpic 


Pnn^? mimV\/=*r oftf*T T^f" 1 !? \X7lttl 

K^upy numDcr aiici .ri^iv wiui 
no immuno-precipitation with 
anti-BrdUTP antibody 


v^upy iiuiiiuci ctxicx i v_/X\. 

performed after immuno- 
precipitation with anti- 
BrdUTP antibody 


HCV 


2.19 X 10 7 


7.34 X 10 s 


HCV (1/2) 


1 X 10 7 


3.7 X 10 5 


HCV (1/8) 


3.1 X 10 b 


1.51 X 10 s 


Huh-7 (No HCV) 


19 


2030 


No Br-UTP 


2.17 X 10 7 


4.6 X 10 4 



[0081] As shown in Table 2, the copy number of HCV replicons measured in the assay, 
both with and without precipitation with an anti-BrdUTP antibody, is directly proportional to the 
amount of template replicon RNA added to the reaction mixture. 

[0082] Table 3 shows the data for a known HCV RNA replication inhibitor that is tested 
using the disclosed assay at a concentration of 10 (iM. ACH-0134152 is a published compound 
(Dhanak et al., Biol. Chem. (2002) 41):38322-7. The structure of ACH-0134152 is shown 
below. 



4 




TABLE 3 


Sample 


Copy number after PCR with 
no RNAse protection assay 


Copy number after PCR with 
RNAse protection assay 






Copies as % 
of samples 
with no test 
compound 




Copies as % 
of samples 
with no test 
compound 


No test compound 


46942 


100 


137494 


100 


ACH-0134152 


31156 


66 


16970 


12 
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[0083] In Table 3, although treatment with ACH-0134152 results in less than half 
reduction in total viral RNA synthesis with no RNAse protection, it causes close to a 10-fold 
reduction in the amount of newly initiated viral RNA synthesis after performing RNAse 
protection. Thus, compound ACH-0134152 inhibits HCV RNA synthesis at the initiation stage. 

[0084] The results reported in this example validate this assay for compound screening 
and profiling. The assay has the advantage of quantitatively detecting newly synthesized HCV 
RNA, and permitting but also specific segment of our choice. 

Example 3. Use of an RNA initiation inhibitor to demonstrate de novo initiation of 

REPLICATION IN ISOLATED REPLICATION COMPLEXES 

[0085] An RNA replication assay was used to determine if the replication complexes 
isolated from a membrane fraction were enzymatically active. The ACH-0134152 initiation 
inhibitor was employed to determine if the isolated replication complexes were capable of de 
novo initiation of newly synthesized RNA. Standard replication mixtures contained 50 mM 
HEPES (pH 7.3); 10 mM KC1; 10 mM MgCl 2 ; 0.3 mM MnCl 2 ; 20 Units of RNAse inhibitor; 10 
\ig of actinimycin D per mL; 0.5 mM ATP, GTP, and UTP; 10 \xC\ of [a-P 32 ] CTP; and 6 \i\ of 
the membrane fraction in a total volume of 60 jjI The reaction mixtures were incubated at 30°C 
for 2 hours. The RNA products were extracted with phenol-chloroform, ethanol precipitated, 
and separated on a 1% agarose gel. After electrophoresis, the gel was fixed with 10% glacial 
acetic acid then in ethanol and dried prior to autoradiography. 

[0086] Figure 2 shows an autoradiogram of the experimental results. Lane 4 is a control 
lane with no added inhibitor. Lanes 1-3 show the results with initiation inhibitor concentrations 
of 100, 20, and 4 jiM, respectively. DS stands for double-stranded RNA and SS stands for 
single-stranded RNA. As clearly shown in the autoradiogram, addition of the initiation inhibitor 
reduces the amount of single-stranded RNA produced by the replication complex. In the assay, 
double-stranded RNA is produced largely from elongation of previously initiated RNAs, while 
single-stranded RNA is produced from newly initiated RNAs. The presence of a single-stranded 
RNA band in control lane 4 shows that newly initiated RNAs are formed during replication. 
When ACH-0134152, a known RNA synthesis initiation inhibitor, is added during replication, 
the single-stranded RNA band disappears, while the double-stranded RNA band remains. Thus, 
by employing an RNA synthesis initiation inhibitor, it is confirmed that de novo RNA synthesis 
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initiation occurs in isolated replicase complexes. . 

[0087] A method has been described to identify inhibitors of viral replicon RNA 
synthesis using detection and/or quantitation of a population of newly synthesized RNA. A 
decrease in the amount of newly synthesized RNA made in the presence of a test compound 
compared to a control with no test compound indicates that the test compound is a viral replicon 
RNA synthesis inhibitor. The method is particularly useful for identifying RNA elongation 
inhibitors. In addition, an RNAse protection assay may be employed on the newly synthesized 
RNA to identify RNA initiation inhibitors. It has been clearly demonstrated that isolated 
replication complexes are capable of de novo initiation in addition to elongation of previously 
initiated RNA. An advantage of the method is that initiation inhibitors can be distinguished from 
elongation inhibitors. Another advantage is that quantitation of the newly synthesized RNA 
population is employed to identify viral replicon RNA synthesis inhibitors. Another advantage is 
that a 2'-0-methylated nucleotide can be used to increase the yield of newly synthesized RNA 
produced during replication. 

[0088] From the foregoing it will be appreciated that, although specific embodiments of 
the invention have been described herein for purposes of illustration, various modifications may 
be made without deviating from the spirit and scope of the invention. 
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CLAIMS 

What is claimed is: 

1. A method for determining whether a test compound inhibits RNA synthesis of a 
positive strand RNA virus, comprising: 

contacting an isolated replicase complex for the positive strand RNA virus comprising a 
viral replicon template RNA for the positive strand RNA virus, a labeled nucleotide analog, and 
the test compound, under conditions sufficient for in vitro RNA synthesis, to form a newly 
synthesized RNA population comprising the labeled nucleotide analog; 

detecting the newly synthesized RNA population comprising the labeled nucleotide 

analog; 

quantitating the newly synthesized RNA population comprising the labeled nucleotide 
analog to provide a test RNA amount; and 

comparing the test RNA amount with a control RNA amount of a newly synthesized 
RNA population comprising the labeled nucleotide analog produced in the absence of the test 
compound, wherein a decrease in the test RNA amount compared to the control RNA amount 
indicates that the test compound inhibits RNA synthesis of the positive strand RNA virus. 

2. The method of Claim 1, wherein contacting further comprises contacting with 2'- 
0-methyl-5-methyluridine-5'- triphosphate. 

3. The method of Claim 1, further comprising providing the isolated replicase 
complex comprising the viral replicon template RNA by: 

transfecting a cell line with a viral replicon RNA or a DNA template for a viral replicon 
to provide a transfected cell line, 

incubating the transfected cell line under conditions suitable for production of viral 
replicase complexes, and 

isolating the replicase complexes and the viral replicon template RNA from the cell 
membrane fraction of the transfected cells. 



29 



API-0005 



EXPRESS MAIL NO. EV 402807847 US 



4. The method of Claim 3, wherein the positive strand RNA virus is Hepatitis C 
Virus and the DNA template for a viral replicon is SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID 
NO:3, SEQ ID NO: 4, or SEQ ID NO: 5. 

5. The method of Claim 1, further comprising providing the isolated replicase 
complex comprising the viral replicon template RNA by 

incubating a positive strand RNA virus infected primary cell or cell line under conditions 
suitable for production of viral replicase complexes, and 

isolating the replicase complexes comprising the viral replicon template RNA from the 
cell membrane fraction of the infected cells. 

6. The method of Claim 1, wherein the positive strand RNA virus is Hepatitis C 

Virus. 

7. The method of Claim 1, wherein the labeled nucleotide analog is an analog 
capable of being recognized by a specific antibody, an analog which can be recognized via a 
high specificity binding reaction, or an analog directly detectable as a result of a physical 
property of the analog. 

8. The method of Claim 7, wherein the labeled nucleotide analog is an analog 
capable of being recognized by a specific antibody. 

9. The method of Claim 8, wherein the labeled nucleotide analog is Br-UTP. 

10. The method of Claim 8, wherein detecting comprises contacting the newly 
synthesized RNA with an antibody specific for the labeled nucleotide analog and immuno- 
precipitating the newly synthesized RNA comprising the labeled nucleotide analog to form 
immuno-precipitated RNA. 

11. The method of Claim 10, wherein the labeled nucleotide analog is Br-UTP and 
the specific antibody is an anti-BrdU antibody. 
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12. The method of Claim 10, wherein quantitating the newly synthesized RNA 
comprises performing real time PCR on the immuno-precipitated RNA. 



13. The method of Claim 1, wherein the test compound is an RNA elongation 
inhibitor. 

14. The method of Claim 1, wherein the test compound is an RNA synthesis initiation 
inhibitor. 



15. The method of Claim 1 5 wherein the test compound is a replicase complex activity 
inhibitor. 



16. A method for quantitating newly initiated RNA of a positive strand RNA virus 
comprising: 

contacting an isolated replicase complex for the positive strand RNA virus comprising a 
viral replicon template RNA for the positive strand RNA virus, and a labeled nucleotide analog, 
under conditions sufficient for in vitro RNA synthesis, to form a newly synthesized RNA 
population comprising the labeled nucleotide analog; 

hybridizing a probe and the newly synthesized RNA population comprising the labeled 
nucleotide analog, under stringent hybridization conditions, wherein the probe is complementary 
to at least a portion of a transcription initiation region of the newly synthesized RNA population; 

digesting unhybridized, single-stranded RNA with a single-strand specific ribonuclease to 
form a protected RNA population; 

detecting a protected RNA population comprising the labeled nucleotide analog; and 

quantitating the protected RNA population comprising the labeled nucleotide analog. 

17. The method of Claim 16, wherein contacting further comprises contacting with 
2'-0-methyl-5-methyluridine-5 triphosphate. 
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18. The method of Claim 16, further comprising providing the isolated replicase 
complex comprising the viral replicon template RNA by: 

transfecting a human hepatosoma cell line with a viral replicon RNA or a DNA template 
for a viral replicon to provide a transfected cell line, 

incubating the transfected cell line under conditions suitable for production of viral 
replicase complexes, and 

isolating the replicase complexes comprising the viral replicon template RNA from the cell 
membrane fraction of the transfected cells. 

19. The method of Claim 1 8, wherein the DNA template for the viral replicon is SEQ 
ID NO: 1, SEQ ID NO: 2, SEQ ID NO:3, SEQ ID NO: 4, or SEQ ID NO: 5. 

20. The method of Claim 1 6, further comprising providing the isolated replicase 
complex comprising the viral replicon template RNA by: 

incubating a positive strand RNA virus infected primary cell or cell line under conditions 
suitable for production of viral replicase complexes, and 

isolating the replicase complexes comprising the viral replicon template RNA from the 
cell membrane fraction of the infected cells. 

21 . The method of Claim 16, wherein the labeled nucleotide analog is an analog 
capable of being recognized by a specific antibody, an analog which can be recognized via a 
high specificity binding reaction, or an analog directly detectable as a result of a physical 
property of the analog. 

22. The method of Claim 21, wherein the labeled nucleotide analog is an analog 
capable of being recognized by a specific antibody. 

23. The method of Claim 22, wherein the labeled nucleotide analog is Br-UTP. 
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24. The method of Claim 22, wherein detecting comprises contacting the newly 
synthesized RNA with an antibody specific for the labeled nucleotide analog and immuno- 
precipitating the newly synthesized RNA comprising the labeled nucleotide analog to form 
immuno-precipitated RNA. 

25. The method of Claim 24, wherein the labeled nucleotide analog is Br-UTP and 
the specific antibody is an anti-BrdU antibody. 

26. The method of Claim 25, wherein quantitating the newly synthesized RNA 
comprises performing real time PCR on the immuno-precipitated RNA. 

27. A method for determining whether a test compound is an RNA synthesis initiation 
inhibitor of a positive strand RNA virus comprising: 

contacting an isolated replicase complex for the positive strand RNA virus comprising a 
viral replicon template RNA for the positive strand RNA virus, a labeled nucleotide analog, and 
the test compound, under conditions sufficient for in vitro RNA synthesis, to form a newly 
synthesized RNA population comprising the labeled nucleotide analog; 

hybridizing a probe and the newly synthesized RNA population comprising the labeled 
nucleotide analog, under stringent hybridization conditions, wherein the probe is complementary 
to at least a portion of an initiation region of the newly synthesized RNA population; 

digesting unhybridized, single-stranded RNA with a single-strand specific ribonuclease to 
form a protected RNA population; 

detecting a protected RNA population comprising the labeled nucleotide analog; 

quantitating the protected RNA population comprising the labeled nucleotide analog to 
provide a test RNA amount; and 

comparing the test RNA amount with a control RNA amount of protected RNA 
comprising the labeled nucleotide analog but produced in the absence of the test compound, 
wherein a decrease in the test RNA amount compared to the control RNA amount indicates that 
the test compound inhibits RNA synthesis initiation of the positive strand RNA virus. 
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28. The method of Claim 27, wherein contacting further comprises contacting with a 
2'-0-methyl-5-methyluridine-5'- triphosphate. 

29. The method of Claim 27, further comprising providing the isolated replicase 
complex and the isolated viral replicon template RNA by: 

transfecting a human hepatosoma cell line with a viral replicon RNA or a DNA template 
for a viral replicon to provide a transfected cell line, 

incubating the transfected cell line under conditions suitable for production of viral 
replicase complexes, and 

isolating the replicase complexes comprising the viral replicon template RNA from the 
cell membrane fraction of the transfected cells. 

30. The method of Claim 29, wherein the DNA template for a viral replicon is SEQ 
ID NO: 1, SEQ ID NO: 2, SEQ ID NO:3, SEQ ID NO: 4, or SEQ ID NO: 5. 

3 1 . The method of Claim 27, further comprising providing the isolated replicase 
complex comprising the viral replicon template RNA by: 

incubating a positive strand RNA virus infected primary cell or cell line under conditions 
suitable for production of viral replicase complexes, and 

isolating the replicase complexes comprising the viral replicon template RNA from the 
cell membrane fraction of the infected cells. 

32. The method of Claim 27, wherein the positive strand RNA virus is Hepatitis C 

Virus. 

33. The method of Claim 27, wherein the labeled nucleotide analog is an analog 
capable of being recognized by a specific antibody, an analog which can be recognized via a 
high specificity binding reaction, or an analog directly detectable as a result of a physical 
property of the analog. 
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34. The method of Claim 33, wherein the labeled nucleotide analog is an analog 
capable of being recognized by a specific antibody. 

35. The method of Claim 34, wherein the labeled nucleotide analog is Br-UTP. 

36. The method of Claim 34, wherein detecting comprises contacting the newly 
synthesized RNA with an antibody specific for the labeled nucleotide analog and immuno- 
precipitating the newly synthesized RNA comprising the labeled nucleotide analog to form 
immuno-precipitated RNA. 

37. The method of Claim 36, wherein the labeled nucleotide analog is Br-UTP and 
the specific antibody is an anti-BrdU antibody. 

38. The method of Claim 37, wherein quantitating the newly synthesized RNA 
comprises performing real time PCR on the immuno-precipitated RNA. 

39. A kit for screening a test compound for inhibition of RNA synthesis of a positive 
strand RNA virus, comprising: 

a replicase complex for the positive strand RNA virus; 

a viral replicon template RNA for the positive strand RNA virus; 

instructions for use; and 

a buffer and nucleoside triphosphates sufficient for production of newly synthesized viral 
replicon RNA. 

40. The kit of Claim 39, further comprising a labeled nucleotide analog. 

41. The kit of Claim 40, further comprising a means for detecting the labeled 
nucleotide analog. 

42. The kit of Claim 40, further comprising a means for quantitating the labeled 
nucleotide analog. 
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43. The kit of Claim 39, further comprising a primer, a single-strand specific 
ribonuclease, and instructions for performing a ribonuclease protection assay. 

44. The kit of Claim 39, wherein the nucleoside triphosphates comprise 2'-0-methyl- 
5-methyluridine-5'- triphosphate. 
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QUANTITATIVE ASSAY FOR DETECTION OF NEWLY SYNTHESIZED RNA IN A 
CELL-FREE SYSTEM AND IDENTIFICATION OF RNA SYNTHESIS INHIBITORS 

ABSTRACT 

[0089] Methods for detecting populations of newly synthesized and newly initiated 
RNAs in a cell-free system are described. Preferred templates for the populations of newly 
synthesized and newly initiated RNAs are viral replicon RNAs, particularly HCV viral replicon 
RNA. The newly synthesized and newly initiated RNAs may be formed in the presence of a 
labeled nucleotide analog suitable for detection and/or quantitation of the RNAs. The assay may 
be employed to identify small molecule inhibitors of RNA synthesis of positive strand RNA 
viruses such as Hepatitis C Virus. 
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I" 1: AJ242652 . Hepatitis C virus.. .[gi:5441 834] 



Details 
Feat 



Links 



LOCUS SSE242652 7989 bp RNA linear SYN ll-MAY-2000 

DEFINITION Hepatitis C virus replicon I377/NS3 -3 ' UTR . 
ACCESSION AJ242652 

VERSION AJ242652 .1 GI: 5441834 

KEYWORDS core-neo gene; NS2 gene; NS2 proteinase; NS3 gene; NS3 

proteinase/helicase ; NS3/4A proteinase cofactor; NS4A gene; NS4B 
gene; NS4b protein; NS5a gene; NS5A phosphoprotein; NS5b gene; NS5B 
RNA dependant RNA polymerase; polyprotein. 

SOURCE Hepatitis C virus replicon I377/NS3 -3 ' UTR 

ORGANISM Hepatitis C virus replicon I377/NS3 -3 ' UTR 
artificial sequences; vectors. 

REFERENCE 1 

AUTHORS Lohmann, V. , Korner,F., Koch, J., Herian,U., Theilmann, L . and 
Bartenschlager, R . 

TITLE Replication of subgenomic hepatitis C virus RNAs in a hepatoma cell 

line 

JOURNAL Science 285 (5424), 110-113 (1999) 
MEDLINE 99322193 
PUBMED 10390360 
REFERENCE 2 (bases 1 to 7989) 
AUTHORS Bartenschlager, R. 
TITLE Direct Submission 

JOURNAL Submitted (26-MAY-1999) Bartenschlager R., Institute for Virology, 
Johannes Gutenberg - University Mainz, Obere Zahlbacher Strasse 67, 
55131 Mainz, GERMANY 
FEATURES Location/Qualifiers 
source 1 . .7989 

/organism= n Hepatitis C virus replicon I377/NS3 -3 1 UTR" 
/mol_type= "other RNA" 
/ db_xr e f = " t axon : 9 5 3 6 3 " 
/ focus 

/note= "Neomycin- selectable bicistronic subgenomic 

hepatitis C virus replicon" 
source join(l. .341, 1801. .7758) 

/organism="Hepatitis C virus" 

/mol_type= "other RNA" 

/strain="lb" 

/isolate="Conl" 

/ db_xr e f = " t axon : 1 1 1 0 3 " 

/ 1 i s sue_type= " 1 i ver " 
source 1190 . . 1800 

/organism^ "Encephalomyocarditis virus " 

/mol_type="other RNA" 

/ db_xre f = " t axon : 1 2 1 0 4 " 
5 'UTR 1. .341 

/evidence=experimental 
gene 342 . .1181 

/gene= " core-neo " 
CDS 342 . . 1181 

/gene= " core-neo " 



http://www.ncbi. nlm.nih.gov/entrez/viewer.fcgi?db=nucleotide&val=5441834 
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/function="selectable marker" 
/codon_start=sl 

/product="core-neo fusion protein" 
/protein_id=" CAB46912. 1 " 
/db_xref ="GI : 5441835" 

/translation^ "MSTNPKPQRKTKGRAMIEQDGLHAGSPAAWVERLFGYDWAQQTI 
GCSDAAVFRLSAQGRPVLFVKTDLSGALNELQDEAARLSWLATTGVPCAAVLDWTEA 
GRDWLLLGEVPGQDLLSSHLAPAEKVSIMADAMRRLHTLDPATCPFDHQAKHRIERAR 
TRMEAGLVDQDDLDEEHQGLAPAELFARLKARMPDGEDLWTHGDACLPNIMVENGRF 
SGFIDCGRLGVADRYQDIALATRDIAEELGGEWADRFLVLYGIAAPDSQRIAFYRLLD 
EFF" 

misc_f eature 1190. .1800 

/note=" internal ribosome entry site (IRES) " 
CDS 1801. .7758 

/codon_start=l 

/product ="non- structural polyprotein" 
/protein_id=" CAB46913 .1 " 
/db_xref ="GI : 544183 6" 

/ translat ion= "MAPITAYSQQTRGLLGCIITSLTGRDRNQVEGEVQWSTATQSF 
LATCVNGVCWTVYHGAGSKTLAGPKGPITQMYTNVDQDLVGWQAPPGARSLTPCTCGS 
SDLYLVTRHADVIPVRRRGDSRGSLLSPRPVSYLKGSSGGPLLCPSGHAVGIFRAAVC 
TRGVAKAVDFVPVESMETTMRSPVFTDNSSPPAVPQTFQVAHLHAPTGSGKSTKVPAA 
YAAQGYKVLVLNPSVAATLGFGAYMSKAHGIDPNIRTGVRTITTGAPITYSTYGKFLA 
DGGCSGGAYDIIICDECHSTDSTTILGIGTVLDQAETAGARLWLATATPPGSVTVPH 
PN I EE VALS S TGE I P F YGKAI P I ETI KGGRHL I FCHS KKKCDE L AAKL SGLGLNAVAY 
YRGLDVSVIPTSGDVIWATDALMTGFTGDFDSVIDCNTCVTQTVDFSLDPTFTIETT 
TVPQDAVSRSQRRGRTGRGRMGIYRFVTPGERPSGMFDSSVLCECYDAGCAWYELTPA 
ETSVRLRAYLNTPGLPVCQDHLEFWESVFTGLTHIDAHFLSQTKQAGDNFPYLVAYQA 
TVCARAQAPPPSWDQMWKCLIRLKPTLHGPTPLLYRLGAVQNEVTTTHPITKYIMACM 
SADLEWTSTWVLVGGVLAALAAYCLTTGSWIVGRIILSGKPAIIPDREVLYREFDE 
MEECASHLPYIEQGMQLAEQFKQKAIGLLQTATKQAEAAAPWESKWRTLEAFWAKHM 
WNFISGIQYLAGLSTLPGNPAIASLMAFTASITSPLTTQHTLLFNILGGWVAAQLAPP 
SAASAFVGAGIAGAAVGSIGLGKVLVDILAGYGAGVAGALVAFKVMSGEMPSTEDLVN 
LLPAILSPGALWGWCAAILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPESD 
AAARVTQILSSLTITQLLKRLHQWINEDCSTPCSGSWLRDVWDWICTVLTDFKTWLQS 
KLLPRLPGVPFFSCQRGYKGVWRGDGIMQTTCPCGAQITGHVKNGSMRIVGPRTCSNT 
WHGTFPINAYTTGPCTPSPAPNYSRALWRVAAEEYVEVTRVGDFHYVTGMTTDNVKCP 
CQVPAPEFFTEVDGVRLHRYAPACKPLLREEVTFLVGLNQYLVGSQLPCEPEPDVAVL 
TSMLTDPSHITAETAKRRLARGSPPSLASSSASQLSAPSLKATCTTRHDSPDADLIEA 
NLLWRQEMGGNITRVESENKWILDSFEPLQAEEDEREVSVPAEILRRSRKFPRAMPI 
WARPDYNPPLLESWKDPDYVPPWHGCPLPPAKAPPIPPPRRKRTWLSESTVSSALA 
ELATKTFGSSESSAVDSGTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSDG 
SWSTVSEEASEDWCCSMSYTWTGALITPCAAEETKLPINALSNSLLRHHNLVYATTS 
RSASLRQKKVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSARSK 
FGYGAKDVRNLSSKAVNHIRSVWKDLLEDTETPIDTTIMAKNEVFCVQPEKGGRKPAR 
LIVFPDLGVRVCEKMALYDWSTLPQAVMGSSYGFQYSPGQRVEFLVNAWKAKKCPMG 
FAYDTRCFDSTVTENDIRVEESIYQCCDLAPEARQAIRSLTERLYIGGPLTNSKGQNC 
GYRRCRASGVLTTSCGNTLTCYLKAAAACRAAKLQDCTMLVCGDDLWICESAGTQED 
EASLRAFTEAMTRYSAPPGDPPKPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTT 
PLARAAWETARHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIY 
GACYS IEPLDLPQI IQRLHGLSAFSLHS YSPGEINRVASCLRKLGVPPLRVWRHRARS 
VRARLLSQGGRAATCGKYLFNWAVRTKLKLTPIPAASQLDLSSWFVAGYSGGDIYHSL 
SRARPRWFMWCLLLLSVGVGIYLLPNR" 
mat_peptide 1801. .3696 

/gene="NS3" 

/product="NS3 proteinase/helicase" 
mat_peptide 3697. .3858 

^ /gene="NS4A" 

/product= M NS3/4A proteinase cofactor" 
mat_peptide 3859.. 4641 

/gene="NS4B" 

/product="NS4B protein" 
mat_peptide 4642.. 5982 

/gene="NS5A" 
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/product ="NS5A phosphoprotein" 
mat_peptide 5983.. 7755 

/gene= n NS5B" 

/product="NS5B RNA dependant RNA polymerase" 
gene 1801.. 3696 

/gene="NS3 n 
gene 3697.. 3858 

/gene="NS4A" 
gene 3859. .4641 

/gene="NS4B" 
gene 4642 . .5982 

/gene="NS5A" 
gene 5983.. 7755 

/gene="NS5B" 
3 'UTR 7759. .7989 

ORIGIN 

1 gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 
61 tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 
121 cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 
181 gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 
241 gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 
301 gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 
361 ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 
421 cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 
481 ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 
541 acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 
601 cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 
661 tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga ^ 
721 aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 
781 cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 
841 ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 
901 ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 
961 gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 
1021 tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 
1081 ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 
1141 agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 
1201 gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 
1261 cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 
1321 ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 
13 81 aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 
1441 gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 
1501 aaccccccac -ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 
1561 gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 
1621 tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 
1681 atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 
1741 aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 
1801 atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 
1861 agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 
1921 acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 
1981 ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 
2 041 caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 
2101 ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 
2161 ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 
2221 ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 
2281 acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 
2341 cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 
2401 gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 
2461 gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 
2521 gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 
2581 accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 
2641 tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 
2701 atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 
2761 ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 
2821 gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 
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2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 
6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 
6661 



aaggggggga 
aagctgtccg 
ataccaacta 
ggcgatttcg 
ctggacccga 
cagcggcgag 
gaacggccct 
gcttggtacg 
ccagggttgc 
acccacatag 
ctggtagcat 
caaatgtgga 
tataggctgg 
atggcatgca 
gtcctagcag 
atcatcttgt 
gatgagatgg 
gaacaattca 
gctgctcccg 
tggaatttca 
gcgatagcat 
accctcctgt 
gcttctgctt 
aaggtgcttg 
tttaaggtca 
atcctctccc 
gtgggcccag 
ggtaaccacg 
cagatcctct 
gaggactgct 
acggtgttga 
gtccccttct 
caaaccacct 
atcgtggggc 
accacgggcc 
gctgctgagg 
accactgaca 
gatggggtgc 
acattcctgg 
ccggacgtag 
gctaagcgta 
ctgtctgcgc 
ctcatcgagg 
tcagaaaata 
agggaagtat 
cccatatggg 
tacgtccctc 
cctccacgga 
gagctcgcca 
acggcctctc 
tcctccatgc 
tctaccgtaa 
acaggcgccc 
agcaactctt 
ctgcggcaga 
gtgctcaagg 
gaagcctgta 
gacgtccgga 
ctggaagaca 
gtccaaccag 
gttcgtgtgt 
atgggctctt 
gcctggaaag 
acggtcactg 



ggcacctcat 
gcctcggact 
gcggagacgt 
actcagtgat 
ccttcaccat 
gcaggactgg 
cgggcatgtt 
agctcacgcc 
ccgtctgcca 
acgcccattt 
accaggctac 
agtgtctcat 
gagccgttca 
tgtcggctga 
ctctggccgc 
ccggaaagcc 
aagagtgcgc 
aacagaaggc 
tggtggaatc 
tcagcgggat 
cactgatggc 
ttaacatcct 
tcgtaggcgc 
tggatatttt 
tgagcggcga 
ctggcgccct 
gggagggggc 
tctcccccac 
ctagtcttac 
ccacgccatg 
ctgatttcaa 
tctcatgtca 
gcccatgtgg 
ctaggacctg 
cctgcacgcc 
agtacgtgga 
acgtaaagtg 
ggttgcacag 
tcgggctcaa 
cagtgctcac 
ggctggccag 
cttccttgaa 
ccaacctcct 
aggtagtaat 
ccgttccggc 
cacgcccgga 
cagtggtaca 
ggaagaggac 
caaagacctt 
ctgaccagcc 
ccccccttga 
gcgaggaggc 
tgatcacgcc 
tgctccgtca 
agaaggtcac 
agatgaaggc 
agctgacgcc 
acctatccag 
ctgagacacc 
agaagggggg 
gcgagaaaat 
catacggatt 
cgaagaaatg 
agaatgacat 



tttctgccat 
caatgctgta 
cattgtcgta 
cgactgcaat 
tgagacgacg 
taggggcagg 
cgattcctcg 
cgccgagacc 
ggaccatctg 
cttgtcccag 
ggtgtgcgcc 
acggctaaag 
aaacgaggtt 
cctggaggtc 
gtattgcctg 
ggccatcatt 
ctcacacctc 
aatcgggttg 
caagtggcgg 
acaatattta 
attcacagcc 
ggggggatgg 
cggcatcgct 
ggcaggttat 
gatgccctcc 
agtcgtcggg 
tgtgcagtgg 
gcactatgtg 
catcactcag 
ctccggctcg 
gacctggctc 
acgtgggtac 
agcacagatc 
tagtaacacg 
ctccccggcg 
ggttacgcgg 
cccgtgtcag 
gtacgctcca 
tcaatacctg 
ttccatgctc 
gggatctccc 
ggcaacatgc 
gtggcggcag 
tttggactct 
ggagatcctg 
ttacaaccct 
cgggtgtcca 
ggttgtcctg 
cggcagctcc 
ctccgacgac 
gggggagccg 
tagtgaggac 
atgcgctgcg 
ccacaacttg 
ctttgacaga 
gaaggcgtcc 
cccacattcg 
caaggccgtt 
aattgacacc 
ccgcaagcca 
ggccctttac 
ccaatactct 
ccctatgggc 
ccgtgttgag 



tccaagaaga 
gcatattacc 
gcaacggacg 
acatgtgtca 
accgtgccac 
atgggcattt 
gttctgtgcg 
tcagttaggt 
gagttctggg 
actaagcagg 
agggctcagg 
cctacgctgc 
actaccacac 
gtcacgagca 
acaacaggca 
cccgacaggg 
ccttacatcg 
ctgcaaacag 
accctcgaag 
gcaggcttgt 
tctatcacca 
gtggccgccc 
ggagcggctg 
ggagcagggg 
accgaggacc 
gtcgtgtgcg 
atgaaccggc 
cctgagagcg 
ctgctgaaga 
tggctaagag 
cagtccaagc 
aagggagtct 
accggacatg 
tggcatggaa 
ccaaattatt 
gtgggggatt 
gttccggccc 
gcgtgcaaac 
gttgggtcac 
accgacccct 
ccctccttgg 
actacccgtc 
gagatgggcg 
ttcgagccgc 
cggaggtcca 
ccactgttag 
ttgccgcctg 
tcagaatcta 
gaatcgtcgg 
ggcgacgcgg 
ggggatcccg 
gtcgtctgct 
gaggaaacca 
gtctatgcta 
ctgcaggtcc 
acagttaagg 
gccagatcta 
aaccacatcc 
accatcatgg 
gctcgcctta 
gatgtggtct 
cctggacagc 
ttcgcatatg 
gagtcaatct 



aatgtgatga 
ggggccttga 
ctctaatgac 
cccagacagt 
aagacgcggt 
acaggtttgt 
agtgctatga 
tgcgggctta 
agagcgtctt 
caggagacaa 
ctccacctcc 
acgggccaac 
accccataac 
cctgggtgct 
gcgtggtcat 
aagtccttta 
aacagggaat 
ccaccaagca 
ccttctgggc 
ccactctgcc 
gcccgctcac 
aacttgctcc 
ttggcagcat 
tggcaggcgc 
tggttaacct 
cagcgatact 
tgatagcgtt 
acgctgcagc 
ggcttcacca 
atgtttggga 
tcctgccgcg 
ggcggggcga 
tgaaaaacgg 
cattccccat 
ctagggcgct 
tccactacgt 
ccgaattctt 
ccctcctacg 
agctcccatg 
cccacattac 
ccagctcatc 
atgactcccc 
ggaacatcac 
tccaagcgga 
ggaaattccc 
agtcctggaa 
ccaaggcccc 
ccgtgtcttc 
ccgtcgacag 
gatccgacgt 
atctcagcga 
gctcgatgtc 
agctgcccat 
caacatctcg 
tggacgacca 
ctaaacttct 
aatttggcta 
gctccgtgtg 
caaaaaatga 
tcgtattccc 
ccaccctccc 
gggtcgagtt 
acacccgctg 
accaatgttg 



gctcgccgcg 
tgtatccgtc 
gggctttacc 
cgacttcagc 
gtcacgctcg 
gactccagga 
cgcgggctgt 
cctaaacaca 
tacaggcctc 
cttcccctac 
atcgtgggac 
gcccctgctg 
caaatacatc 
ggtaggcgga 
tgtgggcagg 
ccgggagttc 
gcagctcgcc 
agcggaggct 
gaagcatatg 
tggcaacccc 
cacccaacat 
tcccagcgct 
aggccttggg 
gctcgtggcc 
actccctgct 
gcgtcggcac 
cgcttcgcgg 
acgtgtcact 
gtggatcaac 
ttggatatgc 
attgccggga 
cggcatcatg 
ttccatgagg 
taacgcgtac 
gtggcgggtg 
gacgggcatg 
cacagaagtg 
ggaggaggtc 
cgagcccgaa 
ggcggagacg 
agctagccag 
ggacgctgac 
ccgcgtggag 
ggaggatgag 
tcgagcgatg 
ggacccggac 
tccgatacca 
tgccttggcg 
cggcacggca 
tgagtcgtac 

cgggtcttgg 

ctacacatgg 
caatgcactg 
cagcgcaagc 
ctaccgggac 
atccgtggag 
tggggcaaag 
gaaggacttg 
ggttttctgc 
agatttgggg 
tcaggccgtg 
cctggtgaat 
ttttgactca 
tgacttggcc 
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6721 
6781 
6841 
6901 
6961 
7021 
7081 
7141 
7201 
7261 
7321 
7381 
7441 
7501 
7561 
7621 
7681 
7741 
7801 
7861 
7921 
7981 



// 



cccgaagcca 
actaattcta 
accagctgcg 
aagctccagg 
gcggggaccc 
tctgcccccc 
tcctccaatg 
gaccccacca 
tcctggctag 
actcatttct 
atctacgggg 
catggcctta 
tcatgcctca 
gtccgcgcta 
aactgggcag 
ttatccagct 
gcccgacccc 
ctactcccca 
tttccctttt 
tttttcctct 
tagctgtgaa 
agatcaagt 



gacaggccat 
aagggcagaa 
gtaataccct 
actgcacgat 
aagaggacga 
ctggggaccc 
tgtcagtcgc 
ccccccttgc 
gcaacatcat 
tctccatcct 
cctgttactc 
gcgcattttc 
ggaaacttgg 
ggctactgtc 
taaggaccaa 
ggttcgttgc 
gctggttcat 
accgatgaac 
tttttttctt 
ttttttcctt 
aggtccgtga 



aaggtcgctc 
ctgcggctat 
cacatgttac 
gctcgtatgc 
ggcgagccta 
gcccaaacca 
gcacgatgca 
gcgggctgcg 
catgtatgcg 
tctagctcag 
cattgagcca 
actccatagt 
ggtaccgccc 
ccaggggggg 
gctcaaactc 
tggttacagc 
gtggtgccta 
ggggagctaa 
tttttttttt 
ttctttcctt 
gccgcttgac 



acagagcggc 
cgccggtgcc 
ttgaaggccg 
ggagacgacc 
cgggccttca 
gaatacgact 
tctggcaaaa 
tgggagacag 
cccaccttgt 
gaacaacttg 
cttgacctac 
tactctccag 
ttgcgagtct 
agggctgcca 
actccaatcc 

gggggagaca 

ctcctacttt 
acactccagg 
tttttttttt 
tggtggctcc 
tgcagagagt 



tttacatcgg 
gcgcgagcgg 
ctgcggcctg 
ttgtcgttat 
cggaggctat 
tggagttgat 
gggtgtacta 
ctagacacac 
gggcaaggat 
aaaaagccct 
ctcagatcat 
gtgagatcaa 
ggagacatcg 
cttgtggcaa 
cggctgcgtc 
tatatcacag 
ctgtaggggt 
ccaataggcc 
tttttttttt 
atcttagccc 
gctgatactg 



gggccccctg 
tgtactgacg 
tcgagctgcg 
ctgtgaaagc 
gactagatac 
aacatcatgc 
tctcacccgt 
tccagtcaat 
gatcctgatg 
agattgtcag 
tcaacgactc 
tagggtggct 
ggccagaagt 
gtacctcttc 
ccagttggat 
cctgtctcgt 
aggcatctat 
atcctgtttt 
ttctcctttt 
tagtcacggc 
gcctctctgc 
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KEYWORDS 
SOURCE 
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TITLE 
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SYN 27-JAN-2004 



AB114136 8024 bp DNA linear <. , 

Hepatitis C virus replicon pSGR-JFHl gene for neomysin resistance 
gene product, hepatitis C virus nonstructural protein, complete 
cds . 

AB114136 

AB11413 6. 1 GI =4 0714444 

Hepatitis C virus replicon pSGR-JFHl 
Hepatitis C virus replicon pSGR-JFHl 
artificial sequences; vectors. 

1 

Kato,T., Date,T., Miyamoto, M. , Furusaka,A. , Tokushige, K. , 
Mizokami,M. and Wakita,T. 

Efficient Replication of Hepatitis C Virus Genotype 2a Subgenomic 

Replicon 

Unpublished 

2 (bases 1 to 8024) 

Kato,T. and Wakita,T. 

Direct Submission 

Submitted (04- JUL-2003 ) Takanobu Kato, Nagoya City University 
Graduate School of Medical Sciences, Department of Clinical 
Molecular Informative Medicine; 1, Kawasumi, Mizuho-ku, , Nagoya, 
Aichi 467-8601, Japan (E-mail : ta_kato@med. nagoya -cu . ac . jp, 
Tel:81-52-853-8292, Fax : 81 - 52 - 842 - 002 1 ) 

Location/Qualif iers 

1 . . 8024 

/organism= "Hepatitis C virus replicon pSGR-JFHl" 
/mol_type= "other DNA" 
/ db_xr e f = " t axon : 2 3 7 6 1 7 " 
/ focus 
1. .397 

/organism^ "Hepatitis C virus" 
/mol_type= "other DNA" 

/ isolation_source=" IH/ fulminant hepatitis" 

/ db_xr e f « " t axon : 1 1 1 0 3 " 

/clone= n JFH-l" 

/ count ry = " J apan : Tokyo " 

/ note= " 5 1 UTR 

genotype :2a" 

1207 . . 1767 

/ organism= "Encephalomyocarditis virus " 
/mol_type= n other DNA" 
/db_xref = " taxon : 12104 " 
1774 . .8024 

/organism="Hepatitis C virus" 
/mol_type=" other DNA" 

/isolation_source="IH/ fulminant hepatitis " 

/db_xref =" taxon : 11103 " 

/clone=" JFH-l" 

/ count ry= " Japan : Tokyo " 
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/note="NS3 - 3'X 

genotype : 2a" 
gene 398. .1192 

/gene="NEO" 
CDS 398. .1192 

/gene="NEO" 

/codon_start=l 

/product= n neomysin resistance gene product" 
/protein_id= " BAD06941. 1 " 
/db_xref="GI :40714445" 

/ trans lation= "MIEQDGLHAGSPAAWVERLFGYDWAQQTIGCSDAAVFRLSAQGR 
PVLFVKTDLSGALNELQDEAARLSWLATTGVPCAAVLDWTEAGRDWLLLGEVPGQDL 
LSSHLAPAEKVSIMADAMRRLHTLDPATCPFDHQAKHRIERARTRMEAGLVDQDDLDE 
EHQGLAPAELFARLKARMPDGEDLWTHGDACLPNIMVENGRFSGFIDCGRLGVADRY 
QDIALATRDIAEELGGEWADRFLVLYGIAAPDSQRIAFYRLLDEFF" 

misc_f eature 1207. .1767 

/note="IRES sequence" 

gene 1774 . .7788 

/gene="NS3-NS5b" 

CDS 1774 . .7788 

/gene="NS3-NS5b" 
/codon_start=l 

/product= "hepatitis C virus nonstructural protein" 
/protein_id= " BAD06942 . 1 " 
/db_xref="GI :40714446" 

/ translation="MAPITAYAQQTRGLLGAIWSMTGRDRTEQAGEVQILSTVSQSF 
LGTTISGVLWTVYHGAGNKTLAGLRGPVTQMYSSAEGDLVGWPSPPGTKSLEPCKCGA 
VDLYLVTRNADVIPARRRGDKRGALLSPRPISTLKGSSGGPVLCPRGHWGLFRAAVC 
SRGVAKSIDFIPVETLDWTRSPTFSDNSTPPAVPQTYQVGYLHAPTGSGKSTKVPVA 
YAAQGYKVLVLNPSVAATLGFGAYLSKAHGINPNIRTGVRTVMTGEAITYSTYGKFLA 
DGGCASGAYDI I ICDECHAVDATS ILGIGTVLDQAETAGVRLTVLATATPPGSVTTPH 
PD I EEVGLGREGE I P FYGRAI PLS C I KGGRHL I FCHS KKKCDELAAALRGMGLNAVAY 
YRGLD VS 1 1 P AQGD VVWATD ALMTGYTGD FD S V I DCNVAVTQAVDFS LD PTFT I TTQ 
TVPQDAVSRSQRRGRTGRGRQGTYRYVSTGERASGMFDSWLCECYDAGAAWYDLTPA 
ETTVRLRAYFNTPGLPVCQDHLEFWEAVFTGLTHIDAHFLSQTKQAGENFAYLVAYQA 
TVCARAKAPPPSWDAMWKCLARLKPTLAGPTPLLYRLGPITNEVTLTHPGTKYIATCM 
QADLEVMTSTWVLAGGVLAAVAAYCLATGCVS I IGRLHVNQRVWAPDKEVLYEAFDE 
MEECASRAALIEEGQRIAEMLKSKIQGLLQQASKQAQDIQPAMQASWPKVEQFWARHM 
WNFISGIQYLAGLSTLPGNPAVASMMAFSAALTSPLSTSTTILLNIMGGWLASQIAPP 
AGATGFWSGLVGAAVGSIGLGKVLVDILAGYGAGISGALVAFKIMSGEKPSMEDVIN 
LLPGILSPGALWGVICAAILRRHVGPGEGAVQWMNRLIAFASRGNHVAPTHYVTESD 
ASQRVTQLLGSLTITSLLRRLHNWITEDCPIPCSGSWLRDWDWVCTILTDFKNWLTS 
KLFPKLPGLPFISCQKGYKGVWAGTGIMTTRCPCGANISGNVRLGSMRITGPKTCMNT 
WQGTFPINCYTEGQCAPKPPTNYKTAIWRVAASEYAEVTQHGSYSYVTGLTTDNLKIP 
CQLPSPEFFSWVDGVQIHRFAPTPKPFFRDEVSFCVGLNSYAVGSQLPCEPEPDADVL 
RSMLTDPPHITAETAARRLARGSPPSEASSSVSQLSAPSLRATCTTHSNTYDVDMVDA 
NLLMEGGVAQTEPESRVPVLDFLEPMAEEESDLEPSIPSECMLPRSGFPRALPAWARP 
DYNPPLVESWRRPDYQPPTVAGCALPPPKKAPTPPPRRRRTVGLSESTISEALQQLAI 
KTFGQPPSSGDAGSSTGAGAAESGGPTSPGEPAPSETGSASSMPPLEGEPGDPDLESD 
QVELQPPPQGGGVAPGSGSGSWSTCSEEDDTTVCCSMSYSWTGALITPCSPEEEKLPI 
NPLSNSLLRYHNKVYCTTSKSASQRAKKVTFDRTQVLDAHYDSVLKDIKLAASKVSAR 
LLTLEEACQLTPPHSARSKYGFGAKEVRSLSGRAVNHIKSVWKDLLEDPQTPIPTTIM 
AKNEVFCVI)PAKGGKKPARLIVTPDLGVRVCEKMALYDITQKLPQAVMGASYGFQYSP 
AQRVEYLLKAWAEKKDPMGFSYDTRCFDSTVTERDIRTEESIYQACSLPEEARTAIHS 
LTERLYVGGPMFNSKGQTCGYRRCRASGVLTTSMGNTITCYVKALAACKAAGIVAPTM 
LVCGDDLWISESQGTEEDERNLRAFTEAMTRYSAPPGDPPRPEYDLELITSCSSNVS 
VALGPRGRRRYYLTRDPTTPLARAAWETVRHSPINSWLGNIIQYAPTIWVRMVLMTHF 
FSILMVQDTLDQNLNFEMYGSVYSVNPLDLPAIIERLHGLDAFSMHTYSHHELTRVAS 
ALRKLGAPPLRVWKSRARAVRASLISRGGKAAVCGRYLFNWAVKTKLKLTPLPEARLL 
DLSSWFTVGAGGGDIFHSVSRARPRSLLFGLLLLFVGVGLFLLPAR" 

ORIGIN 

1 acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 
61 cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 
121 ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 
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181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 



aagactgggt 
caagactgct 
cgcttgcgag 
tcaaagaaaa 
cgcaggttct 
aatcggctgc 
tgtcaagacc 
gtggctggcc 
aagggactgg 
tcctgccgag 
ggctacctgc 
ggaagccggt 
cgaactgttc 
tggcgatgcc 
ctgtggccgg 
tgctgaagag 
tcccgattcg 
cctctccctc 
cgtttgtcta 
aacctggccc 
tgcaaggtct 
caacgtctgt 
gcggccaaaa 
ttgtgagttg 
ggctgaagga 
catgctttac 
acgtggtttt 
caaacacgag 
caggccgggg 
tcgggggttt 
ggtccggtca 
cctgggacca 
cggaacgctg 
ccgagaccca 
cacgtcgttg 
ttcatccccg 
acgccaccgg 
ggaaagagca 
aacccctcgg 
aatcccaaca 
acatatggca 
tgcgatgaat 
caagcagaga 
gtgacaaccc 
ttctatggga 
cactcaaaga 
gtggcatact 
gtcgccaccg 
aatgtagcgg 
cagactgtcc 
agacagggca 
gtagtgcttt 
accaccgtca 
cttgaatttt 
caaacaaagc 
gccagagcca 
aagcctacgc 
gtcaccctca 
gtcatgacca 
ctggcgactg 
gcgccggata 
gcggctctca 
ttgctgcagc 
cccaaagtgg 



cctttcttgg 
agccgagtag 
tgccccggga 
accaaaagaa 
ccggccgctt 
tctgatgccg 
gacctgtccg 
acgacgggcg 
ctgctattgg 
aaagtatcca 
ccattcgacc 
cttgtcgatc 
gccaggctca 
tgcttgccga 
ctgggtgtgg 
cttggcggcg 
cagcgcatcg 
ccccccccct 
tatgttattt 
tgtcttcttg 
gttgaatgtc 
agcgaccctt 
gccacgtgta 
gatagttgtg 
tgcccagaag 
atgtgtttag 
cctttgaaaa 
gcctcctggg 
aagtccaaat 
tgtggactgt 
cgcagatgta 
agtctttgga 
atgtcatccc 
tttcgacctt 
ggctcttccg 
ttgagacact 
ctgtgcccca 
ccaaggtccc 
tagctgccac 
ttaggactgg 
aatttctcgc 
gccacgctgt 
cagccggggt 
cccatcccga 
gggcgattcc 
aaaagtgtga 
atagagggtt 
acgccctcat 
tcacccaagc 
cacaagacgc 
cttataggta 
gtgagtgcta 
ggcttagagc 
gggaggcagt 
aagcggggga 
aggcccctcc 
ttgcgggccc 
cacaccctgg 
gcacgtgggt 
gatgcgtttc 
aggaggtcct 
tcgaagaggg 
aggcctctaa 
aacaattttg 



ataaacccac 
cgttgggttg 
ggtctcgtag 
acaccaaccg 
gggtggagag 
ccgtgttccg 
gtgccctgaa 
ttccttgcgc 
gcgaagtgcc 
tcatggctga 
accaagcgaa 
aggatgatct 
aggcgcgcat 
atatcatggt 
cggaccgcta 
aatgggctga 
ccttctatcg 
aacgttactg 
tccaccatat 
acgagcattc 
gtgaaggaag 
tgcaggcagc 
taagatacac 
gaaagagtca 
gtaccccatt 
tcgaggttaa 
acacgatgat 
cgccatagtg 
cctgtccaca 
ttaccacgga 
ctcgagtgct 
gccgtgcaag 
ggctcggaga 
gaaggggtcc 
agcagctgtg 
cgacgttgtt 
gacctatcag 
tgtcgcgtat 
cctggggttt 
agtcaggacc 
cgatgggggc 
ggatgctacc 
cagactaact 
tatagaagag 
cctatcctgc 
cgagctcgcg 
ggacgtctcc 
gacggggtac 
tgtcgacttc 
tgtctcacgc 
tgtttccact 
cgacgcaggg 
gtatttcaac 
tttcaccggc 
gaacttcgcg 
cccgtcctgg 
cacacctctc 
gacgaagtac 
cctagctgga 
catcatcggc 
gtatgaggct 
gcagcggata 
gcaggcccag 
ggccagacac 



tctatgcccg 
cgaaaggcct 
accgtgcacc 
tcgcccaatg 
gctattcggc 
gctgtcagcg 
tgaactgcag 
agctgtgctc 
ggggcaggat 
tgcaatgcgg 
acatcgcatc 
ggacgaagag 
gcccgacggc 
ggaaaatggc 
tcaggacata 
ccgcttcctc 
ccttcttgac 
gccgaagccg 
tgccgtcttt 
ctaggggtct 
cagttcctct 
ggaacccccc 
ctgcaaaggc 
aatggctctc 
gtatgggatc 
aaaaacgtct 
accatggctc 
gtgagtatga 
gtctctcagt 
gctggcaaca 
gagggggact 
tgtggagccg 
cgcggggaca 
tcgggggggc 
tgctctcggg 
acaaggtctc 
gtcgggtact 
gccgcccagg 
ggggcgtacc 
gtgatgaccg 
tgcgctagcg 
tccattctcg 
gtgctggcta 
gtaggcctcg 
atcaagggag 
gcggcccttc 
ataataccag 
actggagact 
agcctggacc 
agtcagcgcc 
ggtgaacgag 
gctgcgtggt 
acgcccggcc 
ctcacacaca 
tacctagtag 
gacgccatgt 
ctgtaccgtt 
atcgccacat 
ggagtcctgg 
cgcttgcacg 
tttgatgaga 
gccgagatgt 
gacatacaac 
atgtggaact 



gccatttggg 
tgtggtactg 
atgagcacaa 
attgaacaag 
tatgactggg 
caggggcgcc 
gacgaggcag 
gacgttgtca 
ctcctgtcat 
cggctgcata 
gagcgagcac 
catcaggggc 
gaggatctcg 
cgcttttctg 
gcgttggcta 
gtgctttacg 
gagttcttct 
cttggaataa 
tggcaatgtg 
ttcccctctc 
ggaagcttct 
acctggcgac 
ggcacaaccc 
ctcaagcgta 
tgatctgggg 
aggccccccg 
ccatcactgc 
cggggcgtga 
ccttcctcgg 
agactctagc 
tggtaggctg 
tcgacctata 
agcggggagc 
cggtgctctg 
gcgtggccaa 
ccactttcag 
tgcatgctcc 
ggtacaaagt 
tatccaaggc 
gggaggccat 
gcgcctatga 
gcatcggaac 
cggccacacc 
ggcgggaggg 
ggagacacct 
ggggcatggg 
ctcagggaga 
ttgactccgt 
ccaccttcac 
gcgggcgcac 
cctcaggaat 
acgatctcac 
tacccgtgtg 
tagacgccca 
cctaccaagc 
ggaagtgcct 
tgggccctat 
gcatgcaagc 
cagccgtcgc 
tcaaccagcg 
tggaggaatg 
tgaagtccaa 
ccgctatgca 
tcattagcgg 



cgtgcccccg 
cctgataggg 
atcctaaacc 
atggattgca 
cacaacagac 
cggttctttt 
cgcggctatc 
ctgaagcggg 
ctcaccttgc 
cgcttgatcc 
gtactcggat 
tcgcgccagc 
tcgtgaccca 
gattcatcga 
cccgtgatat 
gtatcgccgc 
gagtttaaac 
ggccggtgtg 
agggcccgga 
gccaaaggaa 
tgaagacaaa 
aggtgcctct 
cagtgccacg 
ttcaacaagg 
cctcggtgca 
aaccacgggg 
ttatgcccag 
caggacagaa 
aacaaccatc 
cggcttacgg 
gcccagcccc 
tctggtcacg 
attgctctcc 
ccctaggggc 
atccatcgat 
tgacaacagc 
aactggcagt 
actagtgctt 
acatggcatc 
cacgtactcc 
catcatcata 
ggtccttgat 
ccccgggtca 
tgagatcccc 
gattttctgc 
cttgaatgcc 
tgtggtggtc 
gatcgactgc 
tataaccaca 
aggtagagga 
gtttgacagt 
accagcggag 
tcaagaccat 
cttcctctcc 
tacggtgtgc 
ggcccgactc 
taccaatgag 
tgaccttgag 
cgcatattgc 
agtcgtcgtt 
cgcctctagg 
gatccaaggc 
ggcttcatgg 
catccaatac 
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4021 ctcgcaggat tgtcaacact gccagggaac 
4081 gccgccctca ccagtccgtt gtcgaccagt 
4141 tggttagcgt cccagatcgc accacccgcg 
4201 gtgggggctg ccgtgggcag cataggcctg 
4261 tatggtgcgg gcatttcggg ggccctcgtc 
4321 tctatggaag atgtcatcaa tctactgcct 
4381 ggggtcatct gcgcggccat tctgcgccgc 
4441 tggatgaaca ggcttattgc ctttgcttcc 
4501 gtgacggagt cggatgcgtc gcagcgtgtg 
4561 agcctactca gaagactcca caattggata 
4621 tcctggctcc gcgacgtgtg ggactgggtt 
4681 ctgacctcta aattgttccc caagctgccc 
4741 tacaagggtg tgtgggccgg cactggcatc 
4801 atctctggca atgtccgcct gggctctatg 
4861 acctggcagg ggacctttcc tatcaattgc 
4921 cccacgaact acaagaccgc catctggagg 
4981 cagcatgggt cgtactccta tgtaacagga 
5041 caactacctt ctccagagtt tttctcctgg 
5101 cccacaccaa agccgttttt ccgggatgag 
5161 gctgtcgggt cccagcttcc ctgtgaacct 
5221 ctaacagatc cgccccacat cacggcggag 
5281 cctccatctg aggcgagctc ctcagtgagc 
5341 tgcaccaccc acagcaacac ctatgacgtg 
5401 ggcggtgtgg ctcagacaga gcctgagtcc 
54 61 atggccgagg aagagagcga ccttgagccc 
5521 agcgggtttc cacgggcctt accggcttgg 
5581 gaatcgtgga ggaggccaga ttaccaaccg 
5641 cccaagaagg ccccgacgcc tcccccaagg 
5701 accatatcag aagccctcca gcaactggcc 
5761 ggtgatgcag gctcgtccac gggggcgggc 
5821 ggtgagccgg ccccctcaga gacaggttcc 
5881 cctggagatc cggacctgga gtctgatcag 
5 941 ggggtagctc ccggttcggg ctcggggtct 
6001 accgtgtgct gctccatgtc atactcctgg 
6061 gaagaggaaa agttgccaat caaccctttg 
6121 gtgtactgta caacatcaaa gagcgcctca 
6181 acgcaagtgc tcgacgccca ttatgactca 
6241 aaggtcagcg caaggctcct caccttggag 
6301 gcaagatcca agtatggatt cggggccaag 
6361 aaccacatca agtccgtgtg gaaggacctc 
6421 accatcatgg ccaaaaatga ggtgttctgc 
6481 gctcgcctca tcgtttaccc tgacctcggc 
6541 gacattacac aaaagcttcc tcaggcggta 
6601 cctgcccaac gggtggagta tctcttgaaa 
6661 ttttcgtatg atacccgatg cttcgactca 
6721 gagtccatat accaggcctg ctccctgccc 
6781 actgagagac tttacgtagg agggcccatg 
6841 agacgttgcc gcgccagcgg ggtgctaacc 
6901 gtgaaagccc tagcggcctg caaggctgcg 
6961 ggcgatgacc tagtagtcat ctcagaaagc 
7021 agagccttca cggaggccat gaccaggtac 
7081 gaatatgacc tggagctaat aacatcctgt 
7141 cggggccgcc gcagatacta cctgaccaga 
7201 tgggaaacag ttagacactc ccctatcaat 
7261 ccaaccatat gggttcgcat ggtcctaatg 
7321 gacaccctgg accagaacct caactttgag 
7381 ttggaccttc cagccataat tgagaggtta 
7441 tactctcacc acgaactgac gcgggtggct 
7501 ctcagggtgt ggaagagtcg ggctcgcgca 
7561 aaagcggccg tttgcggccg atatctcttc 
7621 actccattgc cggaggcgcg cctactggac 
7681 gggggcgaca tttttcacag cgtgtcgcgc 
7741 ctcctacttt tcgtaggggt aggcctcttc 
7801 taggtacact ccatagctaa ctgttccttt 



cccgcggtgg cttccatgat ggcattcagt 
accaccatcc ttctcaacat catgggaggc 
ggggccaccg gctttgtcgt cagtggcctg 
ggtaaggtgc tggtggacat cctggcagga 
gcattcaaga tcatgtctgg cgagaagccc 
gggatcctgt ctccgggagc cctggtggtg 
cacgtgggac cgggggaggg cgcggtccaa 
agaggaaacc acgtcgcccc tactcactac 
acccaactac ttggctctct tactataacc 
actgaggact gccccatccc atgctccgga 
tgcaccatct tgacagactt caaaaattgg 
ggcctcccct tcatctcttg tcaaaagggg 
atgaccacgc gctgcccttg cggcgccaac 
aggatcacag ggcctaaaac ctgcatgaac 
tacacggagg gccagtgcgc gccgaaaccc 
gtggcggcct cggagtacgc ggaggtgacg 
ctgaccactg acaatctgaa aattccttgc 
gtggacggtg tgcagatcca taggtttgca 
gtctcgttct gcgttgggct taattcctat 
gagcccgacg cagacgtatt gaggtccatg 
actgcggcgc ggcgcttggc acggggatca 
cagctatcag caccgtcgct gcgggccacc 
gacatggtcg atgccaacct gctcatggag 
agggtgcccg ttctggactt tctcgagcca 
tcaataccat cggagtgcat gctccccagg 
gcacggcctg actacaaccc gccgctcgtg 
cccaccgttg ctggttgtgc tctccccccc 
agacgccgga cagtgggtct gagcgagagc 
atcaagacct ttggccagcc cccctcgagc 
gccgccgaat ccggcggtcc gacgtcccct 
gcctcctcta tgccccccct cgagggggag 
gtagagcttc aacctccccc ccaggggggg 
tggtctactt gctccgagga ggacgatacc 
accggggctc taataactcc ctgtagcccc 
agtaactcgc tgttgcgata ccataacaag 
cagagggcta aaaaggtaac ttttgacagg 
gtcttaaagg acatcaagct agcggcttcc 
gaggcgtgcc agttgactcc accccattct 
gaggtccgca gcttgtccgg gagggccgtt 
ctggaagacc cacaaacacc aattcccaca 
gtggaccccg ccaagggggg taagaaacca 
gtccgggtct gcgagaaaat ggccctctat 
atgggagctt cctatggctt ccagtactcc 
gcatgggcgg aaaagaagga ccccatgggt 
accgtcactg agagagacat caggaccgag 
gaggaggccc gcactgccat acactcgctg 
ttcaacagca agggtcaaac ctgcggttac 
actagcatgg gtaacaccat cacatgctat 
gggatagttg cgcccacaat gctggtatgc 
caggggactg aggaggacga gcggaacctg 
tctgcccctc ctggtgatcc ccccagaccg 
tcctcaaatg tgtctgtggc gttgggcccg 
gacccaacca ctccactcgc ccgggctgcc 
tcatggctgg gaaacatcat ccagtatgct 
acacacttct tctccattct catggtccaa 
atgtatggat cagtatactc cgtgaatcct 
cacgggcttg acgccttttc tatgcacaca 
tcagccctca gaaaacttgg ggcgccaccc 
gtcagggcgt ccctcatctc ccgtggaggg 
aattgggcgg tgaagaccaa gctcaaactc 
ttatccagtt ggttcaccgt cggcgccggc 
gcccgacccc gctcattact cttcggccta 
ctactccccg ctcggtagag cggcacacac 
tttttttttt tttttttttt tttttttttt 
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7861 tttttttttt cttttttttt tttttccctc tttcttccct tctcatctta ttctactttc 
7921 tttcttggtg gctccatctt agccctagtc acggctagct gtgaaaggtc cgtgagccgc 
7981 atgactgcag agagtgccgt aactggtctc tctgcagatc atgt 
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History Clipboard Details 

! iGet Subsequence- Feat 

Links 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 



SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES . 

source 



8001 bp RNA 
C virus replicon I3 8 9/NS3-3 'UTR. 



linear SYN ll-MAY-2000 



source 



source 

5 'UTR 

gene 

CDS 



SSE242654 
Hepatitis 
AJ242654 

AJ242 654 .1 GI : 544184 0 

core-neo gene; NS2 gene; NS2 proteinase; NS3 gene; NS3 
proteinase/helicase ; NS3/4A proteinase cof actor; NS4A gene; NS4B 
gene; NS4b protein; NS5a gene; NS5A phosphoprotein; NS5b gene; NS5B 
RNA dependant RNA polymerase; polyprotein. 
Hepatitis C virus replicon I389/NS3-3 'UTR 
Hepatitis C virus replicon I389/NS3-3 'UTR 
artificial sequences; vectors. 
1 

Lohmann, V. , Korner, F . , Koch, J. , Herian,U. , Theilmann, L . and 
Bartenschlager , R . 

Replication of subgenomic hepatitis C virus RNAs in a hepatoma cell 
line 

Science 285 (5424), 110-113 (1999) 

99322193 

10390360 

2 (bases 1 to 8001) 
Bartenschlager , R . 
Direct Submission 

Submitted (26-MAY-1999) Bartenschlager R., Institute for Virology, 
Johannes Gutenberg - University Mainz, Obere Zahlbacher Strasse 67, 
55131 Mainz, GERMANY 

Location/Qualifiers 

1. .8001 

/organism= "Hepatitis C virus replicon 13 89/NS3 -3 ' UTR" 
/mol_type= "other RNA" 
/db_xref ="taxon: 95365" 
/ focus 

/note= "Neomycin- selectable bicistronic subgenomic 

hepatitis C virus replicon" 

join(l. .341,1813. .7770) 

/organism="Hepatitis C virus" 

/mol_type= "other RNA" 

/strain="lb" 

/isolate="Conl" 

/db_xref="taxon: 11103" 

/ 1 i s sue_type= " 1 i ver " 

/ count ry = " Ge rmany " 

1202. .1812 

/organism="Encephalomyocarditis virus " 
/mol_type= "other RNA" 
/ db_xr e f = " t axon : 1 2 1 0 4 " 
1. .341 

/evidence=experimental 
342 . . 1193 
/gene= "core-neo" 
342 . . 1193 
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/ gene= " core -neo " 
/function="selectable marker" 
/codon_start=l 

/product="core-neo fusion protein" 
/protein_id= " CAB46916 . 1 " 
/db_xref ="GI :5441841" 

/ translation^ "MSTNPKPQRKTKRNTNGRAMIEQDGLHAGSPAAWVERLFGYDWA 
QQTIGCSDAAVFRLSAQGRPVLFVKTDLSGALNELQDEAARLSWLATTGVPCAAVLDV 
VTEAGRDWLLLGEVPGQDLLSSHLAPAEKVSIMADAMRRLHTLDPATCPFDHQAKHRI 
ERARTRMEAGLVDQDDLDEEHQGLAPAELFARLKARMPDGEDLWTHGDACLPNIMVE 
NGRFSGFIDCGRLGVADRYQDIALATRDIAEELGGEWADRFLVLYGIAAPDSQRIAFY 
RLLDEFF" 

misc_f eature 1202. .1812 

/note=" internal ribosome entry site (IRES) " 

CDS 1813. .7770 

/codon_start=l 

/product^ "non- structural polyprotein" 
/protein_id= " CAB46917 . 1 " 
/db_xref = "GI : 5441842 " 

/translations "MAPITAYSQQTRGLLGCIITSLTGRDRNQVEGEVQWSTATQSF 
LATCVNGVCWTVYHGAGSKTLAGPKGPITQMYTNVDQDLVGWQAPPGARSLTPCTCGS 
SDLYLVTRHADVIPVRRRGDSRGSLLSPRPVSYLKGSSGGPLLCPSGHAVGIFRAAVC 
TRGVAKAVDFVPVESMETTMRSPVFTDNSSPPAVPQTFQVAHLHAPTGSGKSTKVPAA 
YAAQGYKVLVLNPSVAATLGFGAYMSKAHGIDPNIRTGVRTITTGAPITYSTYGKFLA 
DGGCSGGAYDIIICDECHSTDSTTILGIGTVLDQAETAGARLWLATATPPGSVTVPH 
PNIEEVALSSTGEIPFYGKAIPIETIKGGRHLIFCHSKKKCDELAAKLSGLGLNAVAY 
YRGLDVSVIPTSGDVIWATDALMTGFTGDFDSVIDCNTCVTQTVDFSLDPTFTIETT 
TVPQDAVSRSQRRGRTGRGRMGIYRFVTPGERPSGMFDSSVLCECYDAGCAWYELTPA 
ETSVRLRAYLNTPGLPVCQDHLEFWESVFTGLTHIDAHFLSQTKQAGDNFPYLVAYQA 
TVCARAQAPPPSWDQMWKCLIRLKPTLHGPTPLLYRLGAVQNEVTTTHPITKYIMACM 
SADLEWTSTWVLVGGVLAALAAYCLTTGSWIVGRIILSGKPAIIPDREVLYREFDE 
MEECASHLPYIEQGMQLAEQFKQKAIGLLQTATKQAEAAAPWESKWRTLEAFWAKHM 
WNFISGIQYLAGLSTLPGNPAIASLMAFTASITSPLTTQHTLLFNILGGWVAAQLAPP 
SAASAFVGAGIAGAAVGSIGLGKVLVDILAGYGAGVAGALVAFKVMSGEMPSTEDLVN 
LLPAILSPGALWGWCAAILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPESD 
AAARVTQILSSLTITQLLKRLHQWINEDCSTPCSGSWLRDVWDWICTVLTDFKTWLQS 
KLLPRLPGVPFFSCQRGYKGVWRGDGIMQTTCPCGAQITGHVKNGSMRIVGPRTCSNT 
WHGTFPINAYTTGPCTPSPAPNYSRALWRVAAEEYVEVTRVGDFHYVTGMTTDNVKCP 
CQVPAPEFFTEVDGVRLHRYAPACKPLLREEVTFLVGLNQYLVGSQLPCEPEPDVAVL 
TSMLTDPSHITAETAKRRLARGSPPSLASSSASQLSAPSLKATCTTRHDSPDADLIEA 
NLLWRQEMGGNITRVESENKWILDSFEPLQAEEDEREVSVPAEILRRSRKFPRAMPI 
WARPDYNPPLLESWKDPDYVPPWHGCPLPPAKAPPIPPPRRKRTWLSESTVSSALA 
ELATKTFGSSESSAVDSGTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSDG 
SWSTVSEEASEDWCCSMSYTWTGALITPCAAEETKLPINALSNSLLRHHNLVYATTS 
RSASLRQKKVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSARSK 
FGYGAKDVRNLSSKAVNHIRSVWKDLLEDTETPIDTTIMAKNEVFCVQPEKGGRKPAR 
LIVFPDLGVRVCEKMALYDWSTLPQAVMGSSYGFQYSPGQRVEFLVNAWKAKKCPMG 
FAYDTRCFDSTVTENDIRVEESIYQCCDLAPEARQAIRSLTERLYIGGPLTNSKGQNC 
GYRRCRASGVLTTSCGNTLTCYLKAAAACRAAKLQDCTMLVCGDDLWICESAGTQED 
EASLRAFTEAMTRYSAPPGDPPKPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTT 
PLARAAWETARHTPVNSWLGNI IMYAPTLWARMILMTHFFS ILLAQEQLEKALDCQI Y 
GACYSIEPLDLPQIIQRLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRVWRHRARS 
VRARLLSQGGRAATCGKYLFNWAVRTKLKLTPIPAASQLDLSSWFVAGYSGGDIYHSL 
SRARPRWFMWCLLLLSVGVGI YLLPNR " 
mat_peptide 1813. .3708 

/gene="NS3" 

/product="NS3 proteinase/helicase " 
mat jeptide 3709. .3870 

/gene="NS4A" 

/product="NS3/4A proteinase cofactor" 
mat_peptide 3871. .4653 

/gene="NS4B" 

/product= M NS4B protein" 
mat_peptide 4654.. 5994 
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mat_peptide 

gene 
gene 
gene 
gene 
gene 



3 ' UTR 



/gene="NS5A n 

/product= n NS5A phosphoprotein" 

5995 . .7767 

/gene= n NS5B M 

/product="NS5B RNA dependant RNA polymerase" 

1813 . .3708 

/gene="NS3" 

3709 . .3870 

/gene="NS4A" 

3871 . .4653 

/gene="NS4B" 

4654. .5994 

/gene= n NS5A" 

5995. .7767 

/gene="NS5B" 

7771. .8001 



ORIGIN 



1 gccagccccc 
61 tcttcacgca 
121 cccccctccc 
181 gacgaccggg 
241 gcgagactgc 
301 gtgcttgcga 
361 ctcaaagaaa 
421 acgcaggttc 
481 caatcggctg 
541 ttgtcaagac 
601 cgtggctggc 
661 gaagggactg 
721 ctcctgccga 
cggctacctg 
tggaagccgg 
ccgaactgtt 
atggcgatgc 
actgtggccg 
ttgctgaaga 
ctcccgattc 
cagaccacaa 
aacgttactg 
tccaccatat 
1381 acgagcattc 
1441 gtgaaggaag 
tgcaggcagc 
taagatacac 
gaaagagtca 
gtaccccatt 
tcgaggttaa 
cacgataata 
tgcatcatca 
gtctccaccg 
tatcatggtg 
accaatgtgg 
ccatgcacct 
gtgcgccggc 
aagggctctt 
gctgccgtgt 
2341 gaaaccacta 
2401 acattccagg 
gctgcgtatg 
ctaggtttcg 
gtaaggacca 
gacggtggtt 
gactcgacca 
cgactcgtcg 



781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 



1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 



2461 
2521 
2581 
2641 
2701 
2761 



gattgggggc 
gaaagcgtct 
gggagagcca 
tcctttcttg 
tagccgagta 
gtgccccggg 
aaccaaacgt 
tccggccgct 
ctctgatgcc 
cgacctgtcc 
cacgacgggc 
gctgctattg 
gaaagtatcc 
cccattcgac 
tcttgtcgat 
cgccaggctc 
ctgcttgccg 
gctgggtgtg 
gcttggcggc 
gcagcgcatc 
cggtttccct 
gccgaagccg 
tgccgtcttt 
ctaggggtct 
cagttcctct 
ggaacccccc 
ctgcaaaggc 
aatggctctc 
gtatgggatc 
aaaacgtcta 
ccatggcgcc 
ctagcctcac 
caacacaatc 
ccggctcaaa 
accaggacct 
gcggcagctc 
ggggcgacag 
cgggcggtcc 
gcacccgagg 
tgcggtcccc 
tggcccatct 
cagcccaagg 
gggcgtatat 
tcaccacggg 
gctctggggg 
ctatcctggg 
tgctcgccac 



gacactccac 
agccatggcg 
tagtggtctg 
gatcaacccg 
gtgttgggtc 
aggtctcgta 
aacaccaacg 
tgggtggaga 
gccgtgttcc 
ggtgccctga 
gttccttgcg 
ggcgaagtgc 
atcatggctg 
caccaagcga 
caggatgatc 
aaggcgcgca 
aatatcatgg 
gcggaccgct 
gaatgggctg 
gccttctatc 
ctagcgggat 
cttggaataa 
tggcaatgtg 
ttcccctctc 
ggaagcttct 
acctggcgac 
ggcacaaccc 
ctcaagcgta 
tgatctgggg 
ggccccccga 
tattacggcc 
aggccgggac 
tttcctggcg 
gacccttgcc 
cgtcggctgg 
ggacctttac 
cagggggagc 
actgctctgc 
ggttgcgaag 
ggtcttcacg 
acacgcccct 
gtataaggtg 
gtctaaggca 
tgcccccatc 
cgcctatgac 
catcggcaca 
cgctacgcct 



catagatcac 
ttagtatgag 
cggaaccggt 
ctcaatgcct 
gcgaaaggcc 
gaccgtgcac 
ggcgcgccat 
ggctattcgg 
ggctgtcagc 
atgaactgca 
cagctgtgct 
cggggcagga 
atgcaatgcg 
aacatcgcat 
tggacgaaga 
tgcccgacgg 
tggaaaatgg 
atcaggacat 
accgcttcct 
gccttcttga 
caattccgcc 
ggccggtgtg 
agggcccgga 
gccaaaggaa 
tgaagacaaa 
aggtgcctct 
cagtgccacg 
ttcaacaagg 
cctcggtgca 
accacgggga 
tactcccaac 
aggaaccagg 
acctgcgtca 
ggcccaaagg 
caagcgcccc 
ttggtcacga 
ctactctccc 
ccctcggggc 
gcggtggact 
gacaactcgt 
actggtagcg 
cttgtcctga 
catggtatcg 
acgtactcca 
atcataatat 
gtcctggacc 
ccgggatcgg 



tcccctgtga 
tgtcgtgcag 
gagtacaccg 
ggagatttgg 
ttgtggtact 
catgagcacg 
gattgaacaa 
ctatgactgg 
gcaggggcgc 
ggacgaggca 
cgacgttgtc 
tctcctgtca 
gcggctgcat 
cgagcgagca 
gcatcagggg 
cgaggatctc 
ccgcttttct 
agcgttggct 
cgtgctttac 
cgagttcttc 
cctctccctc 
cgtttgtcta 
aacctggccc 
tgcaaggtct 
caacgtctgt 
gcggccaaaa 
ttgtgagttg 
ggctgaagga 
catgctttac 
cgtggttttc 
agacgcgagg 
tcgaggggga 
atggcgtgtg 
gcccaatcac 
ccggggcgcg 
ggcatgccga 
ccaggcccgt 
acgctgtggg 
ttgtacccgt 
cccctccggc 
gcaagagcac 
acccgtccgt 
accctaacat 
cctatggcaa 
gtgatgagtg 
aagcggagac 
tcaccgtgcc 



ggaactactg 
cctccaggac 
gaattgccag 
gcgtgccccc 
gcctgatagg 
aatcctaaac 
gatggattgc 
gcacaacaga 
ccggttcttt 
gcgcggctat 
actgaagcgg 
tctcaccttg 
acgcttgatc 
cgtactcgga 
ctcgcgccag 
gtcgtgaccc 
ggattcatcg 
acccgtgata 
ggtatcgccg 
tgagtttaaa 
ccccccccct 
tatgttattt 
tgtcttcttg 
gttgaatgtc 
agcgaccctt 
gccacgtgta 
gatagttgtg 
tgcccagaag 
atgtgtttag 
ctttgaaaaa 
cctacttggc 
ggtccaagtg 
ttggactgtc 
ccaaatgtac 
ttccttgaca 
tgtcattccg 
ctcctacttg 
catctttcgg 
cgagtctatg 
cgtaccgcag 
taaggtgccg 
cgccgccacc 
cagaaccggg 
gtttcttgcc 
ccactcaact 
ggctggagcg 
acatccaaac 
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2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 
6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 



atcgaggagg 
atcgagacca 
gagctcgccg 
gatgtatccg 
acgggcttta 
gtcgacttca 
gtgtcacgct 
gtgactccag 
gacgcgggct 
tacctaaaca 
tttacaggcc 
aacttcccct 
ccatcgtggg 
acgcccctgc 
accaaataca 
ctggtaggcg 
attgtgggca 
taccgggagt 
atgcagctcg 
caagcggagg 
gcgaagcata 
cctggcaacc 
accacccaac 
cctcccagcg 
ataggccttg 
gcgctcgtgg 
ctactccctg 
ctgcgtcggc 
ttcgcttcgc 
gcacgtgtca 
cagtggatca 
gattggatat 
cgattgccgg 
gacggcatca 
ggttccatga 
attaacgcgt 
ctgtggcggg 
gtgacgggca 
ttcacagaag 
cgggaggagg 
tgcgagcccg 
acggcggaga 
tcagctagcc 
ccggacgctg 
acccgcgtgg 
gaggaggatg 
cctcgagcga 
aaggacccgg 
cctccgatac 
tctgccttgg 
agcggcacgg 
gttgagtcgt 
gacgggtctt 
tcctacacat 
atcaatgcac 
cgcagcgcaa 
cactaccggg 
ctatccgtgg 
tatggggcaa 
tggaaggact 
gaggttttct 
ccagatttgg 
cctcaggccg 
ttcctggtga 



tggctctgtc 
tcaagggggg 
cgaagctgtc 
tcataccaac 
ccggcgattt 
gcctggaccc 
cgcagcggcg 
gagaacggcc 
gtgcttggta 
caccagggtt 
tcacccacat 
acctggtagc 
accaaatgtg 
tgtataggct 
tcatggcatg 
gagtcctagc 
ggatcatctt 
tcgatgagat 
ccgaacaatt 
ctgctgctcc 
tgtggaattt 
ccgcgatagc 
ataccctcct 
ctgcttctgc 
ggaaggtgct 
cctttaaggt 
ctatcctctc 
acgtgggccc 
ggggtaacca 
ctcagatcct 
acgaggactg 
gcacggtgtt 
gagtcccctt 
tgcaaaccac 
ggatcgtggg 
acaccacggg 
tggctgctga 
tgaccactga 
tggatggggt 
tcacattcct 
aaccggacgt 
cggctaagcg 
agctgtctgc 
acctcatcga 
agtcagaaaa 
agagggaagt 
tgcccatatg 
actacgtccc 
cacctccacg 
cggagctcgc 
caacggcctc 
actcctccat 
ggtctaccgt 
ggacaggcgc 
tgagcaactc 
gcctgcggca 
acgtgctcaa 
aggaagcctg 
aggacgtccg 
tgctggaaga 
gcgtccaacc 
gggttcgtgt 
tgatgggctc 
atgcctggaa 



cagcactgga 
gaggcacctc 
cggcctcgga 
tagcggagac 
cgactcagtg 
gaccttcacc 
aggcaggact 
ctcgggcatg 
cgagctcacg 
gcccgtctgc 
agacgcccat 
ataccaggct 
gaagtgtctc 
gggagccgtt 
catgtcggct 
agctctggcc 
gtccggaaag 
ggaagagtgc 
caaacagaag 
cgtggtggaa 
catcagcggg 
atcactgatg 
gtttaacatc 
tttcgtaggc 
tgtggatatt 
catgagcggc 
ccctggcgcc 
aggggagggg 
cgtctccccc 
ctctagtctt 
ctccacgcca 
gactgatttc 
cttctcatgt 
ctgcccatgt 
gcctaggacc 
cccctgcacg 
ggagtacgtg 
caacgtaaag 
gcggttgcac 
ggtcgggctc 
agcagtgctc 
taggctggcc 
gccttccttg 
ggccaacctc 
taaggtagta 
atccgttccg 
ggcacgcccg 
tccagtggta 
gaggaagagg 
cacaaagacc 
tcctgaccag 
gccccccctt 
aagcgaggag 
cctgatcacg 
tttgctccgt 
gaagaaggtc 
ggagatgaag 
taagctgacg 
gaacctatcc 
cactgagaca 
agagaagggg 
gtgcgagaaa 
ttcatacgga 
agcgaagaaa 



gaaatcccct 
attttctgcc 
ctcaatgctg 
gtcattgtcg 
atcgactgca 
attgagacga 
ggtaggggca 
ttcgattcct 
cccgccgaga 
caggaccatc 
ttcttgtccc 
acggtgtgcg 
atacggctaa 
caaaacgagg 
gacctggagg 
gcgtattgcc 
ccggccatca 
gcctcacacc 
gcaatcgggt 
tccaagtggc 
atacaatatt 
gcattcacag 
ctggggggat 
gccggcatcg 
ttggcaggtt 
gagatgccct 
ctagtcgtcg 
gctgtgcagt 
acgcactatg 
accatcactc 
tgctccggct 
aagacctggc 
caacgtgggt 
ggagcacaga 
tgtagtaaca 
ccctccccgg 
gaggttacgc 
tgcccgtgtc 
aggtacgctc 
aatcaatacc 
acttccatgc 
aggggatctc 
aaggcaacat 
ctgtggcggc 
attttggact 
gcggagatcc 
gattacaacc 
cacgggtgtc 
acggttgtcc 
ttcggcagct 
ccctccgacg 
gagggggagc 
gctagtgagg 
ccatgcgctg 
caccacaact 
acctttgaca 
gcgaaggcgt 
cccccacatt 
agcaaggccg 
ccaattgaca 
ggccgcaagc 
atggcccttt 
ttccaatact 
tgccctatgg 



tttatggcaa 
attccaagaa 
tagcatatta 
tagcaacgga 
atacatgtgt 
cgaccgtgcc 
ggatgggcat 
cggttctgtg 
cctcagttag 
tggagttctg 
agactaagca 
ccagggctca 
agcctacgct 
ttactaccac 
tcgtcacgag 
tgacaacagg 
ttcccgacag 
tcccttacat 
tgctgcaaac 
ggaccctcga 
tagcaggctt 
cctctatcac 
gggtggccgc 
ctggagcggc 
atggagcagg 
ccaccgagga 
gggtcgtgtg 
ggatgaaccg 
tgcctgagag 
agctgctgaa 
cgtggctaag 
tccagtccaa 
acaagggagt 
tcaccggaca 
cgtggcatgg 
cgccaaatta 
gggtggggga 
aggttccggc 
cagcgtgcaa 
tggttgggtc 
tcaccgaccc 
ccccctcctt 
gcactacccg 
aggagatggg 
ctttcgagcc 
tgcggaggtc 
ctccactgtt 
cattgccgcc 
tgtcagaatc 
ccgaatcgtc 
acggcgacgc 
cgggggatcc 
acgtcgtctg 
cggaggaaac 
tggtctatgc 
gactgcaggt 
ccacagttaa 
cggccagatc 
ttaaccacat 
ccaccatcat 
cagctcgcct 
acgatgtggt 
ctcctggaca 
gcttcgcata 



agccatcccc 
gaaatgtgat 
ccggggcctt 
cgctctaatg 
cacccagaca 
acaagacgcg 
ttacaggttt 
cgagtgctat 
gttgcgggct 
ggagagcgtc 
ggcaggagac 
ggctccacct 
gcacgggcca 
acaccccata 
cacctgggtg 
cagcgtggtc 
ggaagtcctt 
cgaacaggga 
agccaccaag 
agccttctgg 
gtccactctg 
cagcccgctc 
ccaacttgct 
tgttggcagc 
ggtggcaggc 
cctggttaac 
cgcagcgata 
gctgatagcg 
cgacgctgca 
gaggcttcac 
agatgtttgg 
gctcctgccg 
ctggcggggc 
tgtgaaaaac 
aacattcccc 
ttctagggcg 
tttccactac 
ccccgaattc 
acccctccta 
acagctccca 
ctcccacatt 
ggccagctca 
tcatgactcc 
cgggaacatc 
gctccaagcg 
caggaaattc 
agagtcctgg 
tgccaaggcc 
taccgtgtct 
ggccgtcgac 
gggatccgac 
cgatctcagc 
ctgctcgatg 
caagctgccc 
tacaacatct 
cctggacgac 
ggctaaactt 
taaatttggc 
ccgctccgtg 
ggcaaaaaat 
tatcgtattc 
ctccaccctc 
gcgggtcgag 
tgacacccgc 
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6661 tgttttgact caacggtcac tgagaatgac atccgtgttg aggagtcaat ctaccaatgt 

6721 tgtgacttgg cccccgaagc cagacaggcc ataaggtcgc tcacagagcg gctttacatc 

6781 gggggccccc tgactaattc taaagggcag aactgcggct atcgccggtg ccgcgcgagc 

6841 ggtgtactga cgaccagctg cggtaatacc ctcacatgtt acttgaaggc cgctgcggcc 

6901 tgtcgagctg cgaagctcca ggactgcacg atgctcgtat gcggagacga ccttgtcgtt 

6961 atctgtgaaa gcgcggggac ccaagaggac gaggcgagcc tacgggcctt cacggaggct 

7021 atgactagat actctgcccc ccctggggac ccgcccaaac cagaatacga cttggagttg 

7081 ataacatcat gctcctccaa tgtgtcagtc gcgcacgatg catctggcaa aagggtgtac 

7141 tatctcaccc gtgaccccac cacccccctt gcgcgggctg cgtgggagac agctagacac 

7201 actccagtca attcctggct aggcaacatc atcatgtatg cgcccacctt gtgggcaagg 

7261 atgatcctga tgactcattt cttctccatc cttctagctc aggaacaact tgaaaaagcc 

7321 ctagattgtc agatctacgg ggcctgttac tccattgagc cacttgacct acctcagatc 

7381 attcaacgac tccatggcct tagcgcattt tcactccata gttactctcc aggtgagatc 

7441 aatagggtgg cttcatgcct caggaaactt ggggtaccgc ccttgcgagt ctggagacat 

7501 cgggccagaa gtgtccgcgc taggctactg tcccaggggg ggagggctgc cacttgtggc 

7561 aagtacctct tcaactgggc agtaaggacc aagctcaaac tcactccaat cccggctgcg 

7621 tcccagttgg atttatccag ctggttcgtt gctggttaca gcgggggaga catatatcac 

7681 agcctgtctc gtgcccgacc ccgctggttc atgtggtgcc tactcctact ttctgtaggg 

7741 gtaggcatct atctactccc caaccgatga acggggagct aaacactcca ggccaatagg 

7801 ccatcctgtt tttttccctt tttttttttc tttttttttt tttttttttt tttttttttt 

7861 ttttctcctt tttttttcct ctttttttcc ttttctttcc tttggtggct ccatcttagc 

7921 cctagtcacg gctagctgtg aaaggtccgt gagccgcttg actgcagaga gtgctgatac 
7981 tggcctctct gcagatcaag t 
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LOCUS SSE242653 8649 bp RNA linear SYN ll-MAY-2000 

DEFINITION Hepatitis C virus replicon I3 8 9/NS2-3 'UTR. 
ACCESSION AJ242653 

VERSION AJ242653.1 01:5441837 

KEYWORDS core-neo gene; NS2 gene; NS2 proteinase; NS3 gene; NS3 

proteinase/helicase; NS3/4A proteinase cofactor; NS4A gene; NS4B 
gene; NS4b protein; NS5a gene; NS5A phosphoprotein; NS5b gene; NS5B 
RNA dependant RNA polymerase; polyprotein. 

SOURCE Hepatitis C virus replicon I389/NS2 -3 ' UTR 

ORGANISM Hepatitis C virus replicon I389/NS2 -3 ' UTR 
artificial sequences; vectors. 

REFERENCE 1 

AUTHORS Lohmann,V., Korner,F., Koch, J., Herian,U., Theilmann, L . and 
Bartenschlager , R . 

TITLE Replication of subgenomic hepatitis C virus RNAs in a hepatoma cell 

line 

JOURNAL Science 285 (5424), 110-113 (1999) 
MEDLINE 99322193 
PUBMED 10390360 
REFERENCE 2 (bases 1 to 8649) 
AUTHORS Bartenschlager , R . 
TITLE Direct Submission 

JOURNAL Submitted (26-MAY-1999) Bartenschlager R., Institute for Virology, 
Johannes Gutenberg - University Mainz, Obere Zahlbacher Strasse 67, 
55131 Mainz, GERMANY 
FEATURES Location/Qualifiers 
source 1 . . 8649 

/organism="Hepatitis C virus replicon 13 89/NS2 - 3 ' UTR" 
/mol_type= "other RNA" 
/ db_xr e f = " t axon : 9 5 3 6 4 " 
/focus 

/note="Neomycin-selectable bicistronic subgenomic 

hepatitis C virus replicon" 
source join(l. .341,1813. .8418) 

/organism="Hepatitis C virus" 

/mol_type= "other RNA" 

/strain="lb" 

/isolate="Conl" 

/db_xref="taxon: 11103" 

/ tissue_type= "liver " 

/ count ry = " Ge rmany " 
source 1202 . . 1812 

/organism^ "Encephalomyocarditis virus" 

/mol_type="other RNA" 

/ db_xr e f = " t axon : 1 2 1 0 4 " 
5 'UTR 1. .341 

/evidence=experimental 
gene 342 . . 1193 

/gene= " core-neo " 
CDS 342 . . 1193 
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/gene= " core-neo" 
/function= M selectable marker" 
/codon_start=l 

/product^" core-neo fusion protein" 
/protein_id=" CAB46914 ,1 " 
/db_xref ="GI :5441838" 

/translations "MSTNPKPQRKTKRNTNGRAMIEQDGLHAGSPAAWVERLFGYDWA 
QQTIGCSDAAVFRLSAQGRPVLFVKTDLSGALNELQDEAARLSWLATTGVPCAAVLDV 
VTEAGRDWLLLGEVPGQDLLSSHLAPAEKVSIMADAMRRLHTLDPATCPFDHQAKHRI 
ERARTRMEAGLVDQDDLDEEHQGLAPAELFARLKARMPDGEDLWTHGDACLPNIMVE 
NGRFSGFIDCGRLGVADRYQDIALATRDIAEELGGEWADRFLVLYGIAAPDSQRIAFY 
RLLDEFF" 

misc_f eature 12 02 . .1812 

/note=" internal ribosome entry site (IRES) " 

CDS 1813 . . 8418 

/codon_start=l 

/product ="non- structural polyprotein" 
/protein_id= " CAB46915 . 1 " 
/db_xref ="GI : 544183 9" 

/ 1 r ans 1 a t i on= " MDREMAAS CGGAVFVGL I LLTLS PH YKLFLARL I WWLQYF I TRA 
EAHLQVWIPPLNVRGGRDAVILLTCAIHPELIFTITKILLAILGPLMVLQAGITKVPY 
FVRAHGLIRACMLVRKVAGGHYVQMALMKLAALTGTYVYDHLTPLRDWAHAGLRDLAV 
AVEPWFSDMETKVITWGADTAACGDIILGLPVSARRGREIHLGPADSLEGQGWRLLA 
PITAYSQQTRGLLGCIITSLTGRDRNQVEGEVQWSTATQSFLATCVNGVCWTVYHGA 
GSKTLAGPKGPITQMYTNVDQDLVGWQAPPGARSLTPCTCGSSDLYLVTRHADVIPVR 
RRGDSRGSLLSPRPVSYLKGSSGGPLLCPSGHAVGIFRAAVCTRGVAKAVDFVPVESM 
ETTMRSPVFTDNSSPPAVPQTFQVAHLHAPTGSGKSTKVPAAYAAQGYKVLVLNPSVA 
ATLGFGAYMSKAHGIDPNIRTGVRTITTGAPITYSTYGKFLADGGCSGGAYDIIICDE 
CHSTDSTTILGIGTVLDQAETAGARLWLATATPPGSVTVPHPNIEEVALSSTGEIPF 
YGKAIPIETIKGGRHLIFCHSKKKCDELAAKLSGLGLNAVAYYRGLDVSVIPTSGDVI 
WATDALMTGFTGDFDSVIDCNTCVTQTVDFSLDPTFTIETTTVPQDAVSRSQRRGRT 
GRGRMGIYRFVTPGERPSGMFDSSVLCECYDAGCAWYELTPAETSVRLRAYLNTPGLP 
VCQDHLEFWESVFTGLTHIDAHFLSQTKQAGDNFPYLVAYQATVCARAQAPPPSWDQM 
WKCLIRLKPTLHGPTPLLYRLGAVQNEVTTTHPITKYIMACMSADLEWTSTWVLVGG 
VLAALAAYCLTTGSWIVGRIILSGKPAIIPDREVLYREFDEMEECASHLPYIEQGMQ 
LAEQFKQKAIGLLQTATKQAEAAAPWESKWRTLEAFWAKHMWNFISGIQYLAGLSTL 
PGNPA I AS LMAFTAS I TS PLTTQHTLL FN I LGGWVAAQLAP P S AAS AF VGAG I AGAAV 
GSIGLGKVLVDILAGYGAGVAGALVAFKVMSGEMPSTEDLVNLLPAILSPGALWGW 
CAAILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPESDAAARVTQILSSLTITQ 
LLKRLHQWINEDCSTPCSGSWLRDVWDWICTVLTDFKTWLQSKLLPRLPGVPFFSCQR 
GYKGWRGDGIMQTTCPCGAQITGHVKNGSMRIVGPRTCSNTWHGTFPINAYTTGPCT 
PSPAPNYSRALWRVAAEEYVEVTRVGDFHYVTGMTTDNVKCPCQVPAPEFFTEVDGVR 
LHRYAPACKPLLREEVTFLVGLNQYLVGSQLPCEPEPDVAVLTSMLTDPSHITAETAK 
RRLARGSPPSLASSSASQLSAPSLKATCTTRHDSPDADLIEANLLWRQEMGGNITRVE 
SENKWILDSFEPLQAEEDEREVSVPAEILRRSRKFPRAMPIWARPDYNPPLLESWKD 
PDYVPPWHGCPLPPAKAPPIPPPRRKRTWLSESTVSSALAELATKTFGSSESSAVD 
SGTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEEASEDWCC 
SMSYTWTGALITPCAAEETKLPINALSNSLLRHHNLVYATTSRSASLRQKKVTFDRLQ 
VLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSARSKFGYGAKDVRNLSSKAV 
NHIRSVWKDLLEDTETPIDTTI^4AKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMA 
LYDWSTLPQAVMGSSYGFQYSPGQRVEFLVNAWKAKKCPMGFAYDTRCFDSTVTEND 
IRVEESIYQCCDLAPEARQAIRSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCG 
NTLTCYLKAAAACRAAKLQDCTMLVCGDDLWICESAGTQEDEASLRAFTEAMTRYSA 
PPGDPPKPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETARHTPVN 
SWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYGACYSIEPLDLPQIIQ 
RLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRVWRHRARSVRARLLSQGGRAATCG 
KYLFNWAVRTKLKLTPIPAASQLDLSSWFVAGYSGGDIYHSLSRARPRWFMWCLLLLS 
VGVGIYLLPNR" 
mat_peptide 1813. .2463 

/gene="NS2" 

/product="NS2 proteinase" 
mat_peptide 2464.. 4356 

/gene="NS3" 

/product="NS3 proteinase/helicase " 
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mat_peptide 4357.. 4518 

/gene="NS4A M 

/product="NS3/4A proteinase cofactor n 
mat_peptide 4519.. 5301 

/gene="NS4B" 

/product="NS4B protein" 
mat_peptide 5302.. 6642 

/gene="NS5A n 

/product="NS5A phosphoprotein" 
mat_peptide 664 3 . .8415 

/gene="NS5B" 

/product="NS5B RNA dependant RNA polymerase" 
gene 1813 . .2463 

/gene="NS2" 
gene 2464 . .4356 

/gene="NS3" 
gene 4357 . .4518 

/gene="NS4A" 
gene 4519. .5301 

/gene="NS4B" 
gene 5302 . . 6642 

/gene="NS5A" 
gene 6643 . . 8415 

/gene="NS5B" 
3 'UTR 8419. .8649 

ORIGIN 

1 gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 
61 tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 
121 cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 
181 gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 
241 gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 
301 gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 
3 61 ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 
421 acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 
481 caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 
541 ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 
601 cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 
661 gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 
721 ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 
781 cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 
841 tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 
901 ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 
961 atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 
1021 actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 
1081 ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 
1141 ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 
1201 cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 
1261 aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 
1321 tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 
1381 acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 
1441 gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 
1501 tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 
1561 taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 
1621 gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 
1681 gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 
1741 tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 
1801 cacgataata ccatggaccg ggagatggca gcatcgtgcg gaggcgcggt tttcgtaggt 
1861 ctgatactct tgaccttgtc accgcactat aagctgttcc tcgctaggct catatggtgg 
1921 ttacaatatt ttatcaccag ggccgaggca cacttgcaag tgtggatccc ccccctcaac 
1981 gttcgggggg gccgcgatgc cgtcatcctc ctcacgtgcg cgatccaccc agagctaatc 
2041 tttaccatca ccaaaatctt gctcgccata ctcggtccac tcatggtgct ccaggctggt 
2101 ataaccaaag tgccgtactt cgtgcgcgca cacgggctca ttcgtgcatg catgctggtg 
2161 cggaaggttg ctgggggtca ttatgtccaa atggctctca tgaagttggc cgcactgaca 
2221 ggtacgtacg tttatgacca tctcacccca ctgcgggact gggcccacgc gggcctacga 
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2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 



gaccttgcgg 
tggggggcag 
agggggaggg 
ctcgcgccta 
agcctcacag 
acacaatctt 
ggctcaaaga 
caggacctcg 
ggcagctcgg 
ggcgacagca 
ggcggtccac 
acccgagggg 
cggtccccgg 
gcccatctac 
gcccaagggt 
gcgtatatgt 
accacgggtg 
tctgggggcg 
atcctgggca 
ctcgccaccg 
gctctgtcca 
aaggggggga 
aagctgtccg 
ataccaacta 
ggcgatttcg 
ctggacccga 
cagcggcgag 
gaacggccct 
gcttggtacg 
ccagggttgc 
acccacatag 
ctggtagcat 
caaatgtgga 
tataggctgg 
atggcatgca 
gtcctagcag 
atcatcttgt 
gatgagatgg 
gaacaattca 
gctgctcccg 
tggaatttca 
gcgatagcat 
accctcctgt 
gcttctgctt 
aaggtgcttg 
tttaaggtca 
atcctctccc 
gtgggcccag 
ggtaaccacg 
cagatcctct 
gaggactgct 
acggtgttga 
gtccccttct 
caaaccacct 
atcgtggggc 
accacgggcc 
gctgctgagg 
accactgaca 
gatggggtgc 
acattcctgg 
ccggacgtag 
gctaagcgta 
ctgtctgcgc 
ctcatcgagg 



tggcagttga 
acaccgcggc 
agatacatct 
ttacggccta 
gccgggacag 
tcctggcgac 
cccttgccgg 
tcggctggca 
acctttactt 
gggggagcct 
tgctctgccc 
ttgcgaaggc 
tcttcacgga 
acgcccctac 
ataaggtgct 
ctaaggcaca 
cccccatcac 
cctatgacat 
tcggcacagt 
ctacgcctcc 
gcactggaga 
ggcacctcat 
gcctcggact 
gcggagacgt 
actcagtgat 
ccttcaccat 
gcaggactgg 
cgggcatgtt 
agctcacgcc 
ccgtctgcca 
acgcccattt 
accaggctac 
agtgtctcat 
gagccgttca 
tgtcggctga 
ctctggccgc 
ccggaaagcc 
aagagtgcgc 
aacagaaggc 
tggtggaatc 
tcagcgggat 
cactgatggc 
ttaacatcct 
tcgtaggcgc 
tggatatttt 
tgagcggcga 
ctggcgccct 
gggagggggc 
tctcccccac 
ctagtcttac 
ccacgccatg 
ctgatttcaa 
tctcatgtca 
gcccatgtgg 
ctaggacctg 
cctgcacgcc 
agtacgtgga 
acgtaaagtg 
ggttgcacag 
tcgggctcaa 
cagtgctcac 
ggctggccag 
cttccttgaa 
ccaacctcct 



gcccgtcgtc 
gtgtggggac 
gggaccggca 
ctcccaacag 
gaaccaggtc 
ctgcgtcaat 
cccaaagggc 
agcgcccccc 
ggtcacgagg 
actctccccc 
ctcggggcac 
ggtggacttt 
caactcgtcc 
tggtagcggc 
tgtcctgaac 
tggtatcgac 
gtactccacc 
cataatatgt 
cctggaccaa 
gggatcggtc 
aatccccttt 
tttctgccat 
caatgctgta 
cattgtcgta 
cgactgcaat 
tgagacgacg 
taggggcagg 
cgattcctcg 
cgccgagacc 
ggaccatctg 
cttgtcccag 
ggtgtgcgcc 
acggctaaag 
aaacgaggtt 
cctggaggtc 
gtattgcctg 
ggccatcatt 
ctcacacctc 
aatcgggttg 
caagtggcgg 
acaatattta 
attcacagcc 
ggggggatgg 
cggcatcgct 
ggcaggttat 
gatgccctcc 
agtcgtcggg 
tgtgcagtgg 
gcactatgtg 
catcactcag 
ctccggctcg 
gacctggctc 
acgtgggtac 
agcacagatc 
tagtaacacg 
ctccccggcg 
ggttacgcgg 
cccgtgtcag 
gtacgctcca 
tcaatacctg 
ttccatgctc 
gggatctccc 
ggcaacatgc 
gtggcggcag 



ttctctgata 
atcatcttgg 
gacagccttg 
acgcgaggcc 
gagggggagg 
ggcgtgtgtt 
ccaatcaccc 
ggggcgcgtt 
catgccgatg 
aggcccgtct 
gctgtgggca 
gtacccgtcg 
cctccggccg 
aagagcacta 
ccgtccgtcg 
cctaacatca 
tatggcaagt 
gatgagtgcc 
gcggagacgg 
accgtgccac 
tatggcaaag 
tccaagaaga 
gcatattacc 
gcaacggacg 
acatgtgtca 
accgtgccac 
atgggcattt 
gttctgtgcg 
tcagttaggt 
gagttctggg 
actaagcagg 
agggctcagg 
cctacgctgc 
actaccacac 
gtcacgagca 
acaacaggca 
cccgacaggg 
ccttacatcg 
ctgcaaacag 
accctcgaag 
gcaggcttgt 
tctatcacca 
gtggccgccc 
ggagcggctg 
ggagcagggg 
accgaggacc 
gtcgtgtgcg 
atgaaccggc 
cctgagagcg 
ctgctgaaga 
tggctaagag 
cagtccaagc 
aagggagtct 
accggacatg 
tggcatggaa 
ccaaattatt 
gtgggggatt 
gttccggccc 
gcgtgcaaac 
gttgggtcac 
accgacccct 
ccctccttgg 
actacccgtc 
gagatgggcg 



tggagaccaa 
gcctgcccgt 
aagggcaggg 
tacttggctg 
tccaagtggt 
ggactgtcta 
aaatgtacac 
ccttgacacc 
tcattccggt 
cctacttgaa 
tctttcgggc 
agtctatgga 
taccgcagac 
aggtgccggc 
ccgccaccct 
gaaccggggt 
ttcttgccga 
actcaactga 
ctggagcgcg 
atccaaacat 
ccatccccat 
aatgtgatga 
ggggccttga 
ctctaatgac 
cccagacagt 
aagacgcggt 
acaggtttgt 
agtgctatga 
tgcgggctta 
agagcgtctt 
caggagacaa 
ctccacctcc 
acgggccaac 
accccataac 
cctgggtgct 
gcgtggtcat 
aagtccttta 
aacagggaat 
ccaccaagca 
ccttctgggc 
ccactctgcc 
gcccgctcac 
aacttgctcc 
ttggcagcat 
tggcaggcgc 
tggttaacct 
cagcgatact 
tgatagcgtt 
acgctgcagc 
ggcttcacca 
atgtttggga 
tcctgccgcg 
ggcggggcga 
tgaaaaacgg 
cattccccat 
ctagggcgct 
tccactacgt 
ccgaattctt 
ccctcctacg 
agctcccatg 
cccacattac 
ccagctcatc 
atgactcccc 
ggaacatcac 



ggttatcacc 
ctccgcccgc 
gtggcgactc 
catcatcact 
ctccaccgca 
tcatggtgcc 
caatgtggac 
atgcacctgc 
gcgccggcgg 
gggctcttcg 
tgccgtgtgc 
aaccactatg 
attccaggtg 
tgcgtatgca 
aggtttcggg 
aaggaccatc 
cggtggttgc 
ctcgaccact 
actcgtcgtg 
cgaggaggtg 
cgagaccatc 
gctcgccgcg 
tgtatccgtc 
gggctttacc 
cgacttcagc 
gtcacgctcg 
gactccagga 
cgcgggctgt 
cctaaacaca 
tacaggcctc 
cttcccctac 
atcgtgggac 
gcccctgctg 
caaatacatc 
ggtaggcgga 
tgtgggcagg 
ccgggagttc 
gcagctcgcc 
agcggaggct 
gaagcatatg 
tggcaacccc 
cacccaacat 
tcccagcgct 
aggccttggg 
gctcgtggcc 
actccctgct 
gcgtcggcac 
cgcttcgcgg 
acgtgtcact 
gtggatcaac 
ttggatatgc 
attgccggga 
cggcatcatg 
ttccatgagg 
taacgcgtac 
gtggcgggtg 
gacgggcatg 
cacagaagtg 
ggaggaggtc 
cgagcccgaa 
ggcggagacg 
agctagccag 
ggacgctgac 
ccgcgtggag 
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6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 
6661 
6721 
6781 
6841 
6901 
6961 
7021 
7081 
7141 
7201 
7261 
7321 
7381 
7441 
7501 
7561 
7621 
7681 
7741 
7801 
7861 
7921 
7981 
8041 
8101 
8161 
8221 
8281 
8341 
8401 
8461 
8521 
8581 
8641 



// 



tcagaaaata 
agggaagtat 
cccatatggg 
tacgtccctc 
cctccacgga 
gagctcgcca 
acggcctctc 
tcctccatgc 
tctaccgtaa 
acaggcgccc 
agcaactctt 
ctgcggcaga 
gtgctcaagg 
gaagcctgta 
gacgtccgga 
ctggaagaca 
gtccaaccag 
gttcgtgtgt 
atgggctctt 
gcctggaaag 
acggtcactg 
cccgaagcca 
actaattcta 
accagctgcg 
aagctccagg 
gcggggaccc 
tctgcccccc 
tcctccaatg 
gaccccacca 
tcctggctag 
actcatttct 
atctacgggg 
catggcctta 
tcatgcctca 
gtccgcgcta 
aactgggcag 
ttatccagct 
gcccgacccc 
ctactcccca 
tttccctttt 
tttttcctct 
tagctgtgaa 
agatcaagt 



aggtagtaat 
ccgttccggc 
cacgcccgga 
cagtggtaca 
ggaagaggac 
caaagacctt 
ctgaccagcc 
ccccccttga 
gcgaggaggc 
tgatcacgcc 
tgctccgtca 
agaaggtcac 
agatgaaggc 
agctgacgcc 
acctatccag 
ctgagacacc 
agaagggggg 
gcgagaaaat 
catacggatt 
cgaagaaatg 
agaatgacat 
gacaggccat 
aagggcagaa 
gtaataccct 
actgcacgat 
aagaggacga 
ctggggaccc 
tgtcagtcgc 
ccccccttgc 
gcaacatcat 
tctccatcct 
cctgttactc 
gcgcattttc 
ggaaacttgg 
ggctactgtc 
taaggaccaa 
ggttcgttgc 
gctggttcat 
accgatgaac 
tttttttctt 
ttttttcctt 
aggtccgtga 



tttggactct 
ggagatcctg 
ttacaaccct 
cgggtgtcca 
ggttgtcctg 
cggcagctcc 
ctccgacgac 
gggggagccg 
tagtgaggac 
atgcgctgcg 
ccacaacttg 
ctttgacaga 
gaaggcgtcc 
cccacattcg 
caaggccgtt 
aattgacacc 
ccgcaagcca 
ggccctttac 
ccaatactct 
ccctatgggc 
ccgtgttgag 
aaggtcgctc 
ctgcggctat 
cacatgttac 
gctcgtatgc 
ggcgagccta 
gcccaaacca 
gcacgatgca 
9C999Ctgcg 
catgtatgcg 
tctagctcag 
cattgagcca 
actccatagt 
ggtaccgccc 
ccaggggggg 
gctcaaactc 
tggttacagc 
gtggtgccta 
ggggagctaa 
tttttttttt 
ttctttcctt 
gccgcttgac 



ttcgagccgc 
cggaggtcca 
ccactgttag 
ttgccgcctg 
tcagaatcta 
gaatcgtcgg 
ggcgacgcgg 
ggggatcccg 
gtcgtctgct 
gaggaaacca 
gtctatgcta 
ctgcaggtcc 
acagttaagg 
gccagatcta 
aaccacatcc 
accatcatgg 
gctcgcctta 
gatgtggtct 
cctggacagc 
ttcgcatatg 
gagtcaatct 
acagagcggc 
cgccggtgcc 
ttgaaggccg 
ggagacgacc 
cgggccttca 
gaatacgact 
tctggcaaaa 
tgggagacag 
cccaccttgt 
gaacaacttg 
cttgacctac 
tactctccag 
ttgcgagtct 
agggctgcca 
actccaatcc 
gggggagaca 
ctcctacttt 
acactccagg 
tttttttttt 
tggtggctcc 
tgcagagagt 



tccaagcgga 
ggaaattccc 
agtcctggaa 
ccaaggcccc 
ccgtgtcttc 
ccgtcgacag 
gatccgacgt 
atctcagcga 
gctcgatgtc 
agctgcccat 
caacatctcg 
tggacgacca 
ctaaacttct 
aatttggcta 
gctccgtgtg 
caaaaaatga 
tcgtattccc 
ccaccctccc 
gggtcgagtt 
acacccgctg 
accaatgttg 
tttacatcgg 
gcgcgagcgg 
ctgcggcctg 
ttgtcgttat 

cggaggctat 

tggagttgat 
gggtgtacta 
ctagacacac 
gggcaaggat 
aaaaagccct 
ctcagatcat 
gtgagatcaa 
ggagacatcg 
cttgtggcaa 
cggctgcgtc 
tatatcacag 
ctgtaggggt 
ccaataggcc 
tttttttttt 
atcttagccc 
gctgatactg 



ggaggatgag 
tcgagcgatg 
ggacccggac 
tccgatacca 
tgccttggcg 
cggcacggca 
tgagtcgtac 
cgggtcttgg 
ctacacatgg 
caatgcactg 
cagcgcaagc 
ctaccgggac 
atccgtggag 
tggggcaaag 
gaaggacttg 
ggttttctgc 
agatttgggg 
tcaggccgtg 
cctggtgaat 
ttttgactca 
tgacttggcc 
gggccccctg 
tgtactgacg 
tcgagctgcg 
ctgtgaaagc 
gactagatac 
aacatcatgc 
tctcacccgt 
tccagtcaat 
gatcctgatg 
agattgtcag 
tcaacgactc 
tagggtggct 
ggccagaagt 
gtacctcttc 
ccagttggat 
cctgtctcgt 
aggcatctat 
atcctgtttt 
ttctcctttt 
tagtcacggc 
gcctctctgc 
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Links 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 



SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



source 



source 

5 'UTR 

gene 

CDS 



SSE242651 8637 bp RNA linear SYN ll-MAY-2000 

Hepatitis C virus replicon I377/NS2-3 'UTR. 

AJ242651 

AJ242 651. 1 GI : 5441831 

core-neo gene; NS2 gene; NS2 proteinase; NS3 gene; NS3 
proteinase/helicase; NS3/4A proteinase cofactor; NS4A gene; NS4B 
gene; NS4b protein; NS5a gene; NS5A phosphoprotein; NS5b gene; NS5B 
RNA dependant RNA polymerase; polyprotein. 
Hepatitis C virus replicon I377/NS2-3 'UTR 
Hepatitis C virus replicon I377/NS2-3 'UTR 
artificial sequences; vectors. 
1 

Lohmann, V. , Korner, F. , Koch, J. , Herian,U. , Theilmann, L. and 
Bartenschlager , R. 

Replication of subgenomic hepatitis C virus RNAs in a hepatoma cell 
line 

Science 285 (5424), 110-113 (1999) 

99322193 

10390360 

2 (bases 1 to 8637) 
Bartenschlager, R. 
Direct Submission 

Submitted (26-MAY-1999) Bartenschlager R. , Institute for Virology, 
Johannes Gutenberg - University Mainz, Obere Zahlbacher Strasse 67, 
55131 Mainz, GERMANY 

Location/Qualifiers 

1. .8637 

/organism="Hepatitis C virus replicon 13 77/ NS 2 - 3 ' UTR " 
/mol_type=" other RNA" 
/ db_xr e f = " t axon : 9 5 3 6 2 » 
/focus 

/note="Neomycin-selectable bicistronic subgenomic 

hepatitis C virus replicon" 

join(l. .341,1801. .8406) 

/organism= "Hepatitis C virus" 

/mol_type= "other RNA" 

/strain="lb" 

/isolate="Conl" 

/db_xre f = " t axon : 1 1 1 0 3 " 

/ 1 i s sue_type = " 1 i ve r " 

/country= "Germany" 

1190. . 1800 

/organism="Encephalomyocarditis virus " 
/mol_type="other RNA" 
/db_xr e f = " t axon : 1 2 1 0 4 " 
1. .341 

/evidence=experimental 
342 . . 1181 
/gene= " core-neo " 
342. .1181 
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/gene= " core-neo" 
/function="selectable marker" 
/codon_start=l 

/product="core-neo fusion protein" 
/protein_id= , ' CAB46910 ,1 " 
/db_xref="GI :5441832" 

/ translation= "MSTNPKPQRKTKGRAMIEQDGLHAGSPAAWVERLFGYDWAQQTI 
GCSDAAVFRLSAQGRPVLFVKTDLSGALNELQDEAARLSWLATTGVPCAAVLDWTEA 
GRDWLLLGEVPGQDLLSSHLAPAEKVSIMADAMRRLHTLDPATCPFDHQAKHRIERAR 
TRMEAGLVDQDDLDEEHQGLAPAELFARLKARMPDGEDLWTHGDACLPNIMVENGRF 
SGFIDCGRLGVADRYQDIALATRDIAEELGGEWADRFLVLYGIAAPDSQRIAFYRLLD 
EFF" 

misc_feature 1190 . .18 0 0 

/note=" internal ribosome entry site (JRES) " 
CDS 1801. .8406 

/codon_start=l 

/product ="non- structural polyprotein" 
/protein_id= " CAB46911.1 " 
/db_xref ="GI : 5441833" 

/ t r ans 1 a t i on= " MDREMAAS CGGAVFVGL I LLTLS PHYKLFLARL I WWLQYF I TRA 
EAHLQVWIPPLNVRGGRDAVILLTCAIHPELIFTITKILLAILGPLMVLQAGITKVPY 
FVRAHGL I RACMLVRKVAGGHYVQMALMKLAALTGTYVYDHLTPLRDWAHAGLRDLAV 
AVEPWFSDMETKVITWGADTAACGDIILGLPVSARRGREIHLGPADSLEGQGWRLLA 
PITAYSQQTRGLLGCIITSLTGRDRNQVEGEVQWSTATQSFLATCVNGVCWTVYHGA 
GSKTLAGPKGPITQMYTNVDQDLVGWQAPPGARSLTPCTCGSSDLYLVTRHADVIPVR 
RRGDSRGSLLSPRPVSYLKGSSGGPLLCPSGHAVGIFRAAVCTRGVAKAVDFVPVESM 
ETTMRSPVFTDNSSPPAVPQTFQVAHLHAPTGSGKSTKVPAAYAAQGYKVLVLNPSVA 
ATLGFGAYMS KAHG I D PN I RTGVRT I TTGAP I T YS T YGKFL ADGGC S GG AYD III CDE 
CHSTDSTTILGIGTVLDQAETAGARLWLATATPPGSVTVPHPNIEEVALSSTGEIPF 
YGKAIP I ET I KGGRHL I FCHS KKKCDELAAKLS GLGLNAVAY YRGLDVS VI PTS GD V I 
WATDALMTGFTGDFDSVIDCNTCVTQTVDFSLDPTFTIETTTVPQDAVSRSQRRGRT 
GRGRMGIYRFVTPGERPSGMFDSSVLCECYDAGCAWYELTPAETSVRLRAYLNTPGLP 
VCQDHLEFWESVFTGLTHIDAHFLSQTKQAGDNFPYLVAYQATVCARAQAPPPSWDQM 
WKCLIRLKPTLHGPTPLLYRLGAVQNEVTTTHPITKYIMACMSADLEWTSTWVLVGG 
VLAALAAYCLTTGSWI VGRI ILSGKPAI IPDREVLYREFDEMEECASHLPYIEQGMQ 
LAEQFKQKAIGLLQTATKQAEAAAPWESKWRTLEAFWAKHMWNFISGIQYLAGLSTL 
PGNPAIASLMAFTASITSPLTTQHTLLFNILGGWVAAQLAPPSAASAFVGAGIAGAAV 
GS IGLGKVLVD I LAGYGAGVAGALVAFKVMSGEMPSTEDLVNLLPAI LS PGALVVGVV 
CAAI LRRHVGPGEGAVQWMNRL I AFASRGNHVS PTHYVPE SDAAARVTQ ILSSLTITQ 
LLKRLHQWINEDCSTPCSGSWLRDVWDWICTVLTDFKTWLQSKLLPRLPGVPFFSCQR 
GYKGVWRGDGIMQTTCPCGAQITGHVKNGSMRIVGPRTCSNTWHGTFPINAYTTGPCT 
PSPAPNYSRALWRVAAEEYVEVTRVGDFHYVTGMTTDNVKCPCQVPAPEFFTEVDGVR 
LHRYAPACKPLLREEVTFLVGLNQYLVGSQLPCEPEPDVAVLTSMLTDPSHITAETAK 
RRLARGSPPSLASSSASQLSAPSLKATCTTRHDSPDADLIEANLLWRQEMGGNITRVE 
SENKWILDSFEPLQAEEDEREVSVPAEILRRSRKFPRAMPIWARPDYNPPLLESWKD 
PDYVPPWHGCPLPPAKAPPIPPPRRKRTWLSESTVSSALAELATKTFGSSESSAVD 
SGTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEEASEDWCC 
SMSYTWTGALITPCAAEETKLPINALSNSLLRHHNLVYATTSRSASLRQKKVTFDRLQ 
VLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSARSKFGYGAKDVRNLSSKAV 
NHIRSVWKDLLEDTETPIDTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMA 
LYDWSTLPQAVMGSSYGFQYSPGQRVEFLVNAWKAKKCPMGFAYDTRCFDSTVTEND 
IRVEESIYQCCDLAPEARQAIRSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCG 
NTLTCYLKAAAACRAAKLQDCTMLVCGDDLWICESAGTQEDEASLRAFTEAMTRYSA 
PPGDPPKPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETARHTPVN 
SWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYGACYSIEPLDLPQIIQ 
RLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRVWRHRARSVRARLLSQGGRAATCG 
KYLFNWAVRTKLKLTPIPAASQLDLSSWFVAGYSGGDIYHSLSRARPRWFMWCLLLLS 
VGVGI YLLPNR " 
mat jeptide 1801. .2451 

/gene="NS2" 

/product="NS2 proteinase" 
matjeptide 2452.. 4344 

/gene="NS3" 

/product="NS3 proteinase/helicase " 
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mat 


peptide 


4345. .4506 






/gene='»NS4A" 






/product="NS3/4A proteinase cof actor" 


mat 


^peptide 


4507. .5289 






/gene="NS4B" 






/product="NS4B protein" 


mat 


^peptide 


5290. .6630 






/gene= H NS5A" 






/product="NS5A phosphoprotein" 


mat 


peptide 


6631. .8403 



/gene="NS5B" 

/product="NS5B RNA dependant RNA polymerase" 
gene 1801. .2451 

/gene= n NS2" 
gene 2452 . .4344 

/gene="NS3" 
gene 4345 . . 4506 

/gene="NS4A" 
gene 4507 . .5289 

/gene= n NS4B M 
gene 5290 . . 6630 

/gene="NS5A" 
gene 6631. . 8403 

/gene= n NS5B M 
3 'UTR 8407. .8637 

ORIGIN 

1 gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 

61 tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 

121 cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 

181 gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 

241 gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 

3 01 gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 
361 ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 
421 cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 

4 81 ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 
541 acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 
6 01 cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 
661 tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 
721 aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 
781 cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 
841 ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 
901 ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 
961 gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 

1021 tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 

1081 ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 

1141 agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 

1201 gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 

12 61 cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 
1321 ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 

13 81 aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 
1441 gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 
1501 aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 
1561 gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 
1621 tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 
1681 atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 
1741 aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 
1801 atggaccggg agatggcagc atcgtgcgga ggcgcggttt tcgtaggtct gatactcttg 
1861 accttgtcac cgcactataa gctgttcctc gctaggctca tatggtggtt acaatatttt 
1921 atcaccaggg ccgaggcaca cttgcaagtg tggatccccc ccctcaacgt tcgggggggc 
1981 cgcgatgccg tcatcctcct cacgtgcgcg atccacccag agctaatctt taccatcacc 
2041 aaaatcttgc tcgccatact cggtccactc atggtgctcc aggctggtat aaccaaagtg 
2101 ccgtacttcg tgcgcgcaca cgggctcatt cgtgcatgca tgctggtgcg gaaggttgct 
2161 gggggtcatt atgtccaaat ggctctcatg aagttggccg cactgacagg tacgtacgtt 
2221 tatgaccatc tcaccccact gcgggactgg gcccacgcgg gcctacgaga ccttgcggtg 
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2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 



gcagttgagc 
accgcggcgt 
atacatctgg 
acggcctact 
cgggacagga 
ctggcgacct 
cttgccggcc 
ggctggcaag 
ctttacttgg 
gggagcctac 
ctctgcccct 
gcgaaggcgg 
ttcacggaca 
gcccctactg 
aaggtgcttg 
aaggcacatg 
cccatcacgt 
tatgacatca 
ggcacagtcc 
acgcctccgg 
actggagaaa 
cacctcattt 
ctcggactca 
ggagacgtca 
tcagtgatcg 
ttcaccattg 
aggactggta 
ggcatgttcg 
ctcacgcccg 
gtctgccagg 
gcccatttct 
caggctacgg 
tgtctcatac 
gccgttcaaa 
tcggctgacc 
ctggccgcgt 
ggaaagccgg 
gagtgcgcct 
cagaaggcaa 
gtggaatcca 
agcgggatac 
ctgatggcat 
aacatcctgg 
gtaggcgccg 
gatattttgg 
agcggcgaga 
ggcgccctag 
gagggggctg 
tcccccacgc 
agtcttacca 
acgccatgct 
gatttcaaga 
tcatgtcaac 
ccatgtggag 
aggacctgta 
tgcacgccct 
tacgtggagg 
gtaaagtgcc 
ttgcacaggt 
gggctcaatc 
gtgctcactt 
ctggccaggg 
tccttgaagg 
aacctcctgt 



ccgtcgtctt 
gtggggacat 
gaccggcaga 
cccaacagac 
accaggtcga 
gcgtcaatgg 
caaagggccc 
cgccccccgg 
tcacgaggca 
tctcccccag 
cggggcacgc 
tggactttgt 
actcgtcccc 
gtagcggcaa 
tcctgaaccc 
gtatcgaccc 
actccaccta 
taatatgtga 
tggaccaagc 
gatcggtcac 
tcccctttta 
tctgccattc 
atgctgtagc 
ttgtcgtagc 
actgcaatac 
agacgacgac 
ggggcaggat 
attcctcggt 
ccgagacctc 
accatctgga 
tgtcccagac 
tgtgcgccag 
ggctaaagcc 
acgaggttac 
tggaggtcgt 
attgcctgac 
ccatcattcc 
cacacctccc 
tcgggttgct 
agtggcggac 
aatatttagc 
tcacagcctc 
ggggatgggt 
gcatcgctgg 
caggttatgg 
tgccctccac 
tcgtcggggt 
tgcagtggat 
actatgtgcc 
tcactcagct 
ccggctcgtg 
cctggctcca 
gtgggtacaa 
cacagatcac 
gtaacacgtg 
ccccggcgcc 
ttacgcgggt 
cgtgtcaggt 
acgctccagc 
aatacctggt 
ccatgctcac 
gatctccccc 
caacatgcac 
ggcggcagga 



ctctgatatg 
catcttgggc 
cagccttgaa 
gcgaggccta 
gggggaggtc 
cgtgtgttgg 
aatcacccaa 
ggcgcgttcc 
tgccgatgtc 
gcccgtctcc 
tgtgggcatc 
acccgtcgag 
tccggccgta 
gagcactaag 
gtccgtcgcc 
taacatcaga 
tggcaagttt 
tgagtgccac 
ggagacggct 
cgtgccacat 
tggcaaagcc 
caagaagaaa 
atattaccgg 
aacggacgct 
atgtgtcacc 
cgtgccacaa 
gggcatttac 
tctgtgcgag 
agttaggttg 
gttctgggag 
taagcaggca 
ggctcaggct 
tacgctgcac 
taccacacac 
cacgagcacc 
aacaggcagc 
cgacagggaa 
ttacatcgaa 
gcaaacagcc 
cctcgaagcc 
aggcttgtcc 
tatcaccagc 
ggccgcccaa 
agcggctgtt 
agcaggggtg 
cgaggacctg 
cgtgtgcgca 
gaaccggctg 
tgagagcgac 
gctgaagagg 
gctaagagat 
gtccaagctc 
gggagtctgg 
cggacatgtg 
gcatggaaca 
aaattattct 

gggggatttc 

tccggccccc 
gtgcaaaccc 
tgggtcacag 
cgacccctcc 
ctccttggcc 
tacccgtcat 
gatgggcggg 



gagaccaagg 
ctgcccgtct 
gggcaggggt 
cttggctgca 
caagtggtct 
actgtctatc 
atgtacacca 
ttgacaccat 
attccggtgc 
tacttgaagg 
tttcgggctg 
tctatggaaa 
ccgcagacat 
gtgccggctg 
gccaccctag 
accggggtaa 
cttgccgacg 
tcaactgact 
ggagcgcgac 
ccaaacatcg 
atccccatcg 
tgtgatgagc 
ggccttgatg 
ctaatgacgg 
cagacagtcg 
gacgcggtgt 
aggtttgtga 
tgctatgacg 
cgggcttacc 
agcgtcttta 
ggagacaact 
ccacctccat 
gggccaacgc 
cccataacca 
tgggtgctgg 
gtggtcattg 
gtcctttacc 
cagggaatgc 
accaagcaag 
ttctgggcga 
actctgcctg 
ccgctcacca 
cttgctcctc 
ggcagcatag 
gcaggcgcgc 
gttaacctac 
gcgatactgc 
atagcgttcg 
gctgcagcac 
cttcaccagt 
gtttgggatt 
ctgccgcgat 
cggggcgacg 
aaaaacggtt 
ttccccatta 
agggcgctgt 
cactacgtga 
gaattcttca 
ctcctacggg 
ctcccatgcg 
cacattacgg 
agctcatcag 
gactccccgg 
aacatcaccc 



ttatcacctg 
ccgcccgcag 
ggcgactcct 
tcatcactag 
ccaccgcaac 
atggtgccgg 
atgtggacca 
gcacctgcgg 
gccggcgggg 
gctcttcggg 
ccgtgtgcac 
ccactatgcg 
tccaggtggc 
cgtatgcagc 
gtttcggggc 
ggaccatcac 
gtggttgctc 
cgaccactat 
tcgtcgtgct 
aggaggtggc 
agaccatcaa 
tcgccgcgaa 
tatccgtcat 
gctttaccgg 
acttcagcct 
cacgctcgca 
ctccaggaga 
cgggctgtgc 
taaacacacc 
caggcctcac 
tcccctacct 
cgtgggacca 
ccctgctgta 
aatacatcat 
taggcggagt 
tgggcaggat 
gggagttcga 
agctcgccga 
cggaggctgc 
agcatatgtg 
gcaaccccgc 
cccaacatac 
ccagcgctgc 
gccttgggaa 
tcgtggcctt 
tccctgctat 
gtcggcacgt 
cttcgcgggg 
gtgtcactca 
ggatcaacga 
ggatatgcac 
tgccgggagt 
gcatcatgca 
ccatgaggat 
acgcgtacac 
ggcgggtggc 
cgggcatgac 
cagaagtgga 
aggaggtcac 
agcccgaacc 
cggagacggc 
ctagccagct 
acgctgacct 
gcgtggagtc 



gggggcagac 
ggggagggag 
cgcgcctatt 
cctcacaggc 
acaatctttc 
ctcaaagacc 
ggacctcgtc 
cagctcggac 
cgacagcagg 
cggtccactg 
ccgaggggtt 
gtccccggtc 
ccatctacac 
ccaagggtat 
gtatatgtct 
cacgggtgcc 
tgggggcgcc 
cctgggcatc 
cgccaccgct 
tctgtccagc 
gggggggagg 
gctgtccggc 
accaactagc 
cgatttcgac 
ggacccgacc 
gcggcgaggc 
acggccctcg 
ttggtacgag 
agggttgccc 
ccacatagac 
ggtagcatac 
aatgtggaag 
taggctggga 
ggcatgcatg 
cctagcagct 
catcttgtcc 
tgagatggaa 
acaattcaaa 
tgctcccgtg 
gaatttcatc 
gatagcatca 
cctcctgttt 
ttctgctttc 
ggtgcttgtg 
taaggtcatg 
cctctcccct 
gggcccaggg 
taaccacgtc 
gatcctctct 
ggactgctcc 
ggtgttgact 
ccccttcttc 
aaccacctgc 
cgtggggcct 
cacgggcccc 
tgctgaggag 
cactgacaac 
tggggtgcgg 
attcctggtc 
ggacgtagca 
taagcgtagg 
gtctgcgcct 
catcgaggcc 
agaaaataag 
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6121 gtagtaattt tggactcttt cgagccgctc 
6181 gttccggcgg agatcctgcg gaggtccagg 
6241 cgcccggatt acaaccctcc actgttagag 
6301 gtggtacacg ggtgtccatt gccgcctgcc 
6361 aagaggacgg ttgtcctgtc agaatctacc 
6421 aagaccttcg gcagctccga atcgtcggcc 
6481 gaccagccct ccgacgacgg cgacgcggga 
6541 ccccttgagg gggagccggg ggatcccgat 
6601 gaggaggcta gtgaggacgt cgtctgctgc 
6661 atcacgccat gcgctgcgga ggaaaccaag 
6721 ctccgtcacc acaacttggt ctatgctaca 
6781 aaggtcacct ttgacagact gcaggtcctg 
6841 atgaaggcga aggcgtccac agttaaggct 
6901 ctgacgcccc cacattcggc cagatctaaa 
6961 ctatccagca aggccgttaa ccacatccgc 
7021 gagacaccaa ttgacaccac catcatggca 
7081 aaggggggcc gcaagccagc tcgccttatc 
7141 gagaaaatgg ccctttacga tgtggtctcc 
7201 tacggattcc aatactctcc tggacagcgg 
7261 aagaaatgcc ctatgggctt cgcatatgac 
7321 aatgacatcc gtgttgagga gtcaatctac 
7381 caggccataa ggtcgctcac agagcggctt 
7441 gggcagaact gcggctatcg ccggtgccgc 
7501 aataccctca catgttactt gaaggccgct 
7561 tgcacgatgc tcgtatgcgg agacgacctt 
7621 gaggacgagg cgagcctacg ggccttcacg 
7681 ggggacccgc ccaaaccaga atacgacttg 
7741 tcagtcgcgc acgatgcatc tggcaaaagg 
7801 ccccttgcgc gggctgcgtg ggagacagct 
7861 aacatcatca tgtatgcgcc caccttgtgg 
7921 tccatccttc tagctcagga acaacttgaa 
7981 tgttactcca ttgagccact tgacctacct 
8041 gcattttcac tccatagtta ctctccaggt 
8101 aaacttgggg taccgccctt gcgagtctgg 
8161 ctactgtccc agggggggag ggctgccact 
8221 aggaccaagc tcaaactcac tccaatcccg 
8281 ttcgttgctg gttacagcgg gggagacata 
8341 tggttcatgt ggtgcctact cctactttct 
8401 cgatgaacgg ggagctaaac actccaggcc 
8461 tttttctttt tttttttttt tttttttttt 
8521 ttttcctttt ctttcctttg gtggctccat 
8581 gtccgtgagc cgcttgactg cagagagtgc 



caagcggagg aggatgagag ggaagtatcc 
aaattccctc gagcgatgcc catatgggca 
tcctggaagg acccggacta cgtccctcca 
aaggcccctc cgataccacc tccacggagg 
gtgtcttctg ccttggcgga gctcgccaca 
gtcgacagcg gcacggcaac ggcctctcct 
tccgacgttg agtcgtactc ctccatgccc 
ctcagcgacg ggtcttggtc taccgtaagc 
tcgatgtcct acacatggac aggcgccctg 
ctgcccatca atgcactgag caactctttg 
acatctcgca gcgcaagcct gcggcagaag 
gacgaccact accgggacgt gctcaaggag 
aaacttctat ccgtggagga agcctgtaag 
tttggctatg gggcaaagga cgtccggaac 
tccgtgtgga aggacttgct ggaagacact 
aaaaatgagg ttttctgcgt ccaaccagag 
gtattcccag atttgggggt tcgtgtgtgc 
accctccctc aggccgtgat gggctcttca 
gtcgagttcc tggtgaatgc ctggaaagcg 
acccgctgtt ttgactcaac ggtcactgag 
caatgttgtg acttggcccc cgaagccaga 
tacatcgggg gccccctgac taattctaaa 
gcgagcggtg tactgacgac cagctgcggt 
gcggcctgtc gagctgcgaa gctccaggac 
gtcgttatct gtgaaagcgc ggggacccaa 
gaggctatga ctagatactc tgccccccct 
gagttgataa catcatgctc ctccaatgtg 
gtgtactatc tcacccgtga ccccaccacc 
agacacactc cagtcaattc ctggctaggc 
gcaaggatga tcctgatgac tcatttcttc 
aaagccctag attgtcagat ctacggggcc 
cagatcattc aacgactcca tggccttagc 
gagatcaata gggtggcttc atgcctcagg 
agacatcggg ccagaagtgt ccgcgctagg 
tgtggcaagt acctcttcaa ctgggcagta 
gctgcgtccc agttggattt atccagctgg 
tatcacagcc tgtctcgtgc ccgaccccgc 
gtaggggtag gcatctatct actccccaac 
aataggccat cctgtttttt tccctttttt 
tttttttttt ctcctttttt tttcctcttt 
cttagcccta gtcacggcta gctgtgaaag 
tgatactggc ctctctgcag atcaagt 
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